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ABSTRACT 

The  organizational  structure,  tasks,  and  system  configu- 
ration  of  the  Indonesian  Army  Data  Collecting  and  Processing 
Service  (DISPOLL AHT AE)  are  presented  briefly  in  crder  to 
provide  a  background  for  succeeding  iiscussions.  Features 
of  available  data  dictionary  systems  (DDS)  and  the  initial 
design  of  a  data  dictionary  for  current  applications  at 
DISPULLAHTAD  are  presented.  Finally,  based  on  that  initial 
design,  a  recommendation  for  database  management  system 
(DBMS)  inplementation  is  discussed. 
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I.  IS TROD OCT ION 


As  a  young  and  developing  information  system  organiza¬ 
tion,  the  Indonesian  Army  Data  Collecting  and  Processing 
Service  (DISPULL  AHTAD)  has  a  tremendous  proliferation  of 
application  files.  It  is  no  surprise  that  there  is  much 
redundancy  of  data  and  efforts.  Data  redundancy  vaster  a 
limited  resources,  and  furthermore,  it  raises  the  problem  of 
inconsistent  data,  that  is,  the  same  element  of  data  having 
different  values  within  different  files.  The  implementation 
of  database  management  system  (DBMS)  could  handle  this 
problem  by  providing  more  control  and  more  effective 
management  of  data. 

Cn  cne  hand,  the  information  generated  electronically 
becomes  more  and  more  in  demand  to  the  point  where  it  has 
become  a  critical  issue  for  the  Indonesian  Army.  Cr  the 
other  hand,  the  personnel  generating  and  maintaining  this 
information  move  dynamically  because  of  requirements  for 
military  tour  of  duty  and  tour  of  area.  This  '  situation 
creates  problems  in  keeping  accurate  and  up  to  date  informa¬ 
tion.  Even  though  the  manual  documentation  is  always  done 
properly,  this  is  not  always  adequate.  It  is  often  the  case 
that  many  applications  are  highly  dependent  up  on  the 
personnel  responsible  for  such  applications.  Standardized 
and  centralized  documentation  is  a  "must",  especially  when  a 
DBMS  is  implemented.  In  this  regard,  the  data  dictionary  is 
a  powerful  vehicle  that  supports  such  documentation. 

According  to  Dolk  [Ref.  1],  "a  data  dictionary  is  a 
collection  of  an  enterprise's  meta-data  designed  as  cne  or 
more  databases  which  can  be  retrieved  and  analyzed  using 
standard  database  management  system  capabilities".  This 
will  be  discussed  further  in  a  subsequent  chapter,  and  will 
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he  considered  as  a  lasis  in  choosing  the  most  appropriate 
Da  MS  to  he  implemented  by  DISPULL AHTAD. 

The  organizational  structure  of  DISPOLLAHTAD,  its  system 
configuration,  and  its  current  various  applications  will  he 
describes  briefly  in  order  to  provide  a  background  for  the 
succeeding  chapters.  A  discussion  of  database  management 
system  and  a  recommendation  of  the  most  appropriate  D  3  ‘-IS  to 
he  implemented  comprises  the  last  chapter. 


II.  INDONESIAN  ARMY  DATA  COLLECTING  AND  PROCESSING  SERVICE 

(DISPOLLjy|TAD) 

A.  ORGANIZATION,  TASK,  AND  SYSTEM  CONPIGOR ATION 

D IS? DLL AHT AD  is  an  acronym  in  the  Indonesian  language 
that  stands  for  the  Indonesian  Army  Data  Collecting  an  1 
Processing  Service.  It  was  initiated  in  fiscal  Year 
1973/1974  and  formally  organized  in  Fiscal  Year  1975/1976 
[Ref.  2].  DISPULLAHTAD  is  located  in  the  Indonesian  Army 
Headquarter  -  Jakarta,  the  capital  city  of  the  Republic  of 
Indonesia. 

The  DISFULL AHTAD ' s  main  task,  is  to  provide  all  informa¬ 
tion  processed  electronically  for  all  organizational 
elements  of  the  Army  requiring  the  information  [Ref.  2],  In 
order  to  be  able  to  accomplish  this  task,  DISPULLAHTAD  is 
equipped  with  several  computer  configurations.  As  of  1964, 
these  include  an  IBM  System  4341,  an  IBM  System  370,  several 
IBM  System  3740s,  and  several  TRS-80s. 

In  the  lower  organizational  level  such  as  Military  Area 
Commands  (KCDAM) ,  Army  Finance  Service  (JAHKUAD),  Army 
Administrative  and  Personnel  Service  ( JANMINPERS AD) ,  Army 
Development  and  Educational  Command  (KOBANGDIKLAT) ,  etc., 
each  has  its  own  Data  Processing  Service  and  it  is  called  as 
PULLAHTA  KOTAM A/LAKPCS  or  PULLAHTA  for  snort.  A  simplified 
organizational  structure  of  the  Indonesian  Army  is  presented 
as  figure  2.1  and  the  position  of  both  DISPULLAHTAD  and 
FULLAHTAs  can  be  clearly  visualized. 

Each  PULLAHTA  has  two  roles:  first  to  process  and 
provide  all  pertinent  information  requested  by  the  organiza¬ 
tion  to  which  it  is  attached,  and  secondly,  to  provide  data 


1 •  FOLIAHTAs  inside  Java  Island 

FULlAHTAs  that  belong  to  Military  Area  Command  ir. 
Java  island  are  equipped  with  an  IBM  System  4331  connected 
to  the  I EM  System  4341  at  DISPOLLAHTAD  in  "online"  mode  via 
dedicated  public  telephone  lines.  This  is  the  first  stage  in 
networking  all  PULLAHTAs  throughout  the  country.  The  data 
interchange  is  done  electronically  through  these  dedicated 
lines. 

2 .  FDllAHTAs  ou  tside  Java  Island 

POLLAHTAs  sited  outside  Java  island  are  equipped 
with  an  IBM  System  3740  and  currently  work  in  "off  line" 
mode.  Eventually  they  will  be  connected  to  DISPULL AHTAD  via 
dedicated  public  telephone  lines.  The  data  interchange  is 
done  manually  using  floppy-disks  transported  via  airline  and 
it  requires  one  to  two  days  for  the  data  to  reach  its 
destination. 

B.  APPLICATIONS 

All  applications  done  by  DISPOLLAHTAD  are  in  crier  to 
fulfill  its  task  of  electronically  providing  information 
needed  by  the  Indonesian  Army.  These  applications  are  sepa¬ 
rated  into  three  kinds  of  Management  Information  Systems 
[Ref.  3]  : 

1 .  Administration  Management  Informa  tic  System 

This  category  includes  applications  in  finance, 
logistics,  personnel,  and  all  applications  pertinent  to 
development,  education,  and  corps/specialty. 

2 •  Military  Management  Information  Sy stem 

The  applications  of  intelligence  ar.d  security, 
territorial,  communication  and  electronics,  and  organiza¬ 
tion,  operation,  and  training  are  included  in  this  category. 


3  •  Planning  and  Controlling  Man  a  gemer.t  I  r.f  era  atior. 

System 

Three  kinds  cf  applications  are  included  in  this 
category:  planning  and  budgeting,  auditing  and  control,  and 
command,  control,  and  communication. 

C.  DATA  REDUNDANCY  E50BLEHS 

There  are  many  possible  data  redundancies  within  those 
applications.  For  instance  consider  data  about  name,  rank, 
S3N,  corps,  occupation,  etc.,  that  belong  to  an  individual 
assigned  as  Intelligence  Officer  in  a  Territorial  Unit.  His 
data  will  appear  in  at  least  four  different  files: 
personnel  file,  payroll  file,  intelligence  file,  and  terri¬ 
torial  file.  If,  fer  instance,  there  is  a  change  to  just 
one  of  those  data  elements,  redundant  effort  is  required  to 
update  all  those  four  files  and  a  high  level  cf  data 
inconsistency  may  result. 

It  happens  many  times  that  top  level  management  detects 
data  inconsistency  in  two  different  reports  generated  by 
DISPUII AHTAD  (e. g.  the  total  number  of  personnel  appears 
differently  in  the  personnel  report  and  the  payroll  report) . 
This  has  become  very  annoying,  and  tremendously  reduces 
credibility  in  the  computer  system.  In  this  regard,  ar. 
effort  should  be  made  to  eliminate  the  problem  and  one  way 
of  doing  that  is  by  designing  and  implementing  a  data 
dictionary  system  (DDE)  in  concert  with  a  DBMS. 

D.  STAGED  DEVELOPHENT  APPROACH 

The  design  of  any  information  system  is  the  most  diffi¬ 
cult  and  critical  step.  It  should  be  done  with  great  care 
and  full  awareness.  As  suggested  by  Sprague  and  Carlson 
[Ref.  4],  there  are  three  different  approaches  in  initiating 
an  information  system: 


•  Quick-Hit  approach. 

This  approach  should  be  done  if  there  is  no  ciar.i.iica- 
tion  whether  such  an  information  system  is  needed  or  not, 
tut  there  is  a  recognized  high  payoff  for  initiating  the 
system.  This  approach  requires  developing  the  system  in  the 
most  beneficial  area,  capturing  the  benefits,  and  then 
considering  what  to  dc  next. 

•  Staged  Development  approach. 

This  approach  is  done  by  developing  the  system  in  the 
most  teneficial  area  as  in  the  quick-hit  approach,  tut  with 
some  advanced  and  clear  planning.  Therefore,  part  of  the 
effort  in  developing  the  first  system  can  be  reused  in 
developing  the  second.  This  approach  is  very  appropriate  for 
initiatirg  an  information  system  clearly  supported  by  top 
management,  but  with  limited  available  resources. 

•  Complete  System  approach. 

This  approach  requires  the  longest  development  time  and 
highest  development  ccsts  before  any  benefits  are  attained. 
Before  building  any  part  of  the  system,  a  full-service 
systen  generator  and  the  organizational  structure  for 
managing  it  must  be  developed  first.  In  this  regard,  this 
approach  represents  the  most  risky  option. 

For  DISPULL AHTAD ,  it  is  anticipated  that  top  management 
will  strongly  support  the  implementation  of  a  DBMS,  there¬ 
fore  the  Quick-Hit  approach  is  not  necessary.  On  the  ether 
hand,  limited  computer  resources  makes  a  Complete  System 
approach  infeasible,  too.  Hence,  the  most  appropriate 
approach  is  the  Staged  Development  approach.  In  the  Staged 
Development  approach,  identifying  the  functional  area  where 
there  is  a  reason  to  expect  the  highest  pay-off  in  starting 
the  project  is  a  crucial  thing.  Using  the  hignest  volume  of 
transaction  as  the  criteria,  and  by  evaluating  the  trans¬ 
actional  data  gathered  between  April  1983  and  December  1983 


(see  Figure  2.2),  there  are  three  applications  having  high 
transactional  volume:  personnel,  payroll,  and  finance  report 
applications.  Only  the  personnel  and  payroll  applications 
have  a  master  file  which  is  maintained  and  used  continu¬ 
ously.  Besides  that,  these  two  applications  are  the  most 
crucial  in  maintaining  personnel  morale  and  the  most  often 
used  in  relation  to  the  personnel  management  task. 

Based  on  these  evaluations,  data  used  by  the  personnel 
and  payroll  applications  will  be  the  first  database  to  be 
implemented  by  DIS  FOIL ART AD.  These  applications  include 
personnel,  payroll,  intelligence  personnel,  and  territorial 
personnel.  It  is  also  implied  that  the  discussion  on  the 
DDS  and  DBMS  will  be  limited  to  those  applications. 

The  plan  for  this  staged  development  approach  are: 

1.  Initial  design  of  DD  covering  personnel,  payroll, 
intelligence  personnel,  and  territorial  personnel  applica¬ 
tions  . 

2.  Implementation  of  personnel  and  payroll  database. 

3.  Design  of  DD  for  all  data  used  by  the  rest  of  appli¬ 
cations  excluded  at  the  first  stage. 

4.  Implementation  of  other  databases. 


III.  DATA  DICTIONARY 

A.  GENERAL 

The  revolutionary  change  in  computer  technology  has 
created  another  challenge  on  hew  to  organize  and  manage  the 
very  large-scale  databases  made  possible  by  the  combination 
of  database  management  systems  (DBMS)  and  powerful  new  hard¬ 
ware  systems.  The  need  to  control  the  enterprise's  data 
becomes  critical  due  to  the  proliferation  of  microcomputers 
that  trigger  more  and  more  applications  which  in  turn 
creates  redundancy  and  data  inconsistency  problems.  At  the 
same  time,  the  number  of  microcomputer  users  demanding 
direct  access  to  the  enterprise  data  is  also  increasing. 
This  direct  access  to  large  and  complex  databases  again 
creates  a  problem  of  how  to  "coordinate"  and  control  these 
complex  information  structures. 

Data  redundancy,  data  inconsistency,  and  the  need  to 
control  the  enterprise's  data  lead  to  the  design  and  imple¬ 
mentation  of  database  systems.  The  database  environment 
itself  assumes  an  architectural  plan  designed  to  minimize 
redundancy  and  to  emphasize  accessibility.  It  assumes 
logical  and  physical  structures  aimed  at  separate  objec¬ 
tives.  It  also  assumes  that  individual  file  may  serve  many 
different  applications.  All  of  this  is  far  too  much 
complexity  to  be  managed  without  precise  and  up-to-date 
documentation  and  control.  The  data  dictionary  is  designed 
to  define  all  appropriate  aspects  of  the  enterprise's  data, 
so  that  it  can  be  used  as  a  tool  to  control  and  manage  the 
database  system  no  matter  how  great  its  size  and  hew  complex 
its  structures. 


B.  INFCRMATION  RESOURCE  MANAGEMENT  (IRM) 


The  concept  of  I  EM  is  that  information  is  a  vital  enter¬ 
prise  asset  that  should  he  invested  in,  and  used  like  ether 
resources  [Ref.  5]. 

IRM  is  the  task  cf  managing  information  resources  such 
as  data,  processes,  users,  software,  and  hardware  in  an 
integrated  and  coordinated  manner.  IRM  includes  all  manage¬ 
ment  aspects  of  the  information-related  operations  of  an 
organization,  such  as  policy  formulation,  resource  alloca¬ 
tion,  implementation,  and  control. 

A  definition  of  IFM  was  formulated  at  a  workshop  on  Data 
Dictionary  Systems  and  Information  Resource  Management  spon¬ 
sored  hy  the  Association  for  Computing  Machinery  and  the 
National  Bureau  of  Standards  in  1980: 

Information  Resource  Management  is  whatever  peliev, 
action,  or  procedure  concerning  information  (both  auto¬ 
mated  and  non-autemated)  which  management  establishes 
that  serve  the  overall  current  and  future  needs  of  the 
enterprise.  Such  policies,  etc.,  would  include  consid¬ 
erations  of  availability,  timeliness,  accuracy,  integ¬ 
rity,  privacy,  securit7,  auditability,  ownership,  use, 
and  cost-effectiveness*. 

This  definition  cf  IRM  was  chosen  to  emphasize  the 
enterprise-wide  nature  of  planning  and  execution  of  informa¬ 
tion  policies,  acticrs,  and  procedures  in  order  that  data 
can  be  treated  as  a  true  resource.  It  also  reflects  the 
primary  shift  of  data  processing  uses  from  processing- 
centered  design  methodologies  to  data-centered 
methodologies. 

1 •  Data  Diction  ary  as  t  he  Tool  of  IRM 

Cne  of  the  problems  encountered  in  IPM  is  the  vast 
amount  cf  data  about  information  resources  required  to 
managed  the  enterprise  data,  together  with  the  very  complex 
and  numerous  relationships  existing  between  them.  This  is 


precisely  the  sort  of  tasK  tnat  a  Data  Dictionary  system  can 
he  made  to  do,  provided  that  it  has  been  conditioned  to  know 
how  to  deal  not  just  with  data  entities  or  process  entities, 
tut  with  the  entire  range  of  information  resource  entities. 

2  .  Cat a  Dictionary 

A  data  dictionary  is  a  collection  of  meta-data  (data 
about  the  enterprise  data)  that  could  consist  of:  the  name 
of  data  (including  its  synonyms  and/or  homonyms) ,  the  loca¬ 
tion  of  data,  a  description  of  the  meaning  of  data,  the 
relation  between  data,  how  the  data  is  used,  who  is  respon¬ 
sible  for  the  data,  the  source  of  the  data,  etc. ,  in  short, 
a  store  cf  all  the  appropriate  information  about  the  data. 

Recently,  there  has  been  a  trend  towards  using  data 
dictionary  to  include  the  following  functions: 

a.  Definition  of  other  data  constructs  such  as 

records  and  files. 

fc.  Definition  of  processes  such  as  programs  or 

manual  processes. 

c.  Definition  of  data  users  whether  individuals  or 
organizational  entities. 

Along  with  these  definitions,  the  lata  dictionary 
also  began  to  be  used  to  document  the  cr oss- r ef  erer.ces 
between  them  and  to  record  their  usage  and  organizational 
responsibilities. 

3 .  Cata  Dictionary  System  (DPS) 

A  Data  Dictionary  System  is  a  combination  of  soft¬ 
ware  and  procedures  that  aid  an  enterprise  in  setting  up  and 
maintaining  its  complex  structure  of  data  resources.  The 
software  itself  may  be  produced  in-house  or  acquired  from 
software  vendors.  Fcr  the  following  three  reasons,  it  is 
often  better  to  purchase  instead  of  building  it  in-house: 


a.  Design  ar.d  Implementation 

The  task  of  designing  and  implementing  a  DCS, 
even  one  of  modest  functionality,  is  definitely  a  non¬ 
trivial  one.  There  exists  good  potential  that  the  magnitude 
of  the  task  will  te  underestimated  and  that  greater 
resources  will  be  needed  than  those  originally  estimated. 

L.  Gaining  a  Success 

If  the  use  of  DDS  is  to  be  at  all  successful, 
the  software  itself  must  conform  to  high  standards  of 
quality  assurance.  The  use  of  the  same  software  at  many 
other  installations,  as  is  the  case  with  a  commercial 
package,  aids  in  the  early  discovery  of  possible  software 
errors  and  their  corrections. 

c.  Technology  Progress 

There  are  good  reasons  for  assuming  that  CDS 
technology  will  continue  to  progress  and  that  substantial 
enhancements  will  significantly  increase  the  usefulness  and 
value  of  the  DDS,  and  it  will  be  difficult  for  an  in-house 
system  tc  keep  pace. 

There  are  several  DDS  commercially  available 
now,  such  as  DB/DC  Data  Dictionary  of  IBM,  DR  TAM AN AG  EE  by 
MSP  Inc.,  Integrated  Data  Dictionary  ( IDD)  by  Culiinet 
Software  Inc.,  DATA DICTIONAR Y  by  Applied  Data  Research  Inc., 
Extended  Data  Dictionary  (XDD)  by  Intell  Systems  Corp. ,  uCC 
TEN  by  University  Computing  Company,  and  Data  Control  System 
(DCS)  by  Cincom  Systems  Inc. 

In  the  following  section,  features  that  one 
could  expect  tc  find  on  those  available  DDS  will  be 
discussed.  It  must  be  pointed  out  that  none  of  the  avail¬ 
able  DDSs  mentioned  above  will  necessary  have  all  explained 
features,  and  there  is  no  such  implication  that  all  of  these 
features  are  required  in  all  DDS  applications. 
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c. 


FEATURES  OF  A  DATA  DICTIONARY 


There  are  several  issues  will  be  iiscussei  here,  such  as 
Icple  mentation  and  Architectural,  Data  Dictionary  (DD)  ar.d 
DD  Schema,  Extensibility  Facilities,  Status  Facilities, 
Dictionary  Commands,  Eridge  Facilities,  and  Data  Dictionary 
System  Security. 

1 .  Implementation  and  Architectural  Issues 
a.  The  Relationship  Detween  DD3  and  DEWS 

The  primary  purpose  of  a  DBMS  is  to  manage  data, 
whereas  the  primary  purpose  of  a  DD S  is  to  manage  meta-data. 
Therefore,  it  is  clear  that  there  is  a  very  little  overlap 
between  these  two,  in  fact  they  are  complementary ;  both 
functions  are  required  for  proper  management  of  information 
resources. 

Some  of  the  functions  a  DDS  will  perform  are  in 
support  cf  one  or  more  DBMSs.  This  is  to  be  expected  as  the 
DDS  will  manage  all  meta-data,  including  meta-data  where  the 
actual  instances  of  data  are  stored  in  a  database,  which  in 
turn  is  being  managed  by  a  DBMS.  An  element  required  for 
this  latter  function,  the  DBMS’s  management  of  data  in  data¬ 
bases,  is  the  knowledge  on  the  part  of  the  DBMS  of  certain 
meta-data  of  the  databases  which  are  required  by  the  DBMS  in 
order  for  it  to  do  its  processing.  This  meta-data  is 
commonly  referred  to  as  the  DBMS-Directory,  and  it  should  be 
clear  that  this  potentially  is  one  area  of  overlap  between 
the  DCS  and  DBMS.  In  this  sense,  it  is  preferable  to  design 
a  DDS  prior  to  the  OEMS  implementation  rather  than  to  build 
a  DDS  that  has  to  be  fitted  toward  an  existing  DBMS.  This 
reason  together  with  an  existing  method  of  implementing  a 
EDS  as  one  of  the  EEMS's  applications  may  explain  why  in 
this  thesis  designing  the  dictionary  is  done  prior  tc  the 
design  of  the  databases. 
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t.  The  Method  of  DDS  Implementation 

It  is  preferable  to  design  a  data  dicticr.ary 
prior  to  the  implementation  of  the  DBMS.  On  the  other  hand, 
the  implementation  of  tnat  designed  data  dictionary  as  a 
complete  DDS  is  a  gcod  candidate  to  be  one  of  the  DBMS 
applications,  and  indeed  a  number  of  existing  DDS  are  imple¬ 
mented  in  this  manner.  But  this  is  not  a  sin3le  option,  the 
implementation  of  DDS  can  be  either  : 

•  DBMS-dependent  system. 

This  is  a  DDS  that  uses  a  D3MS  in  its  implemen¬ 
tation. 

•  Free-standing  system. 

A  DDS  that  doesn’t  use  a  DBMS  in  its  implementa¬ 
tion  is  considered  tc  be  included  in  this  category. 

There  is  no  ultimate  answer  as  to  whether  a 
free-standing  or  DBMS-dependent  DDS  is  tne  best.  There  are 
both  pros  and  cons  tc  this  question,  and  these  depend  cn  the 
enterprise's  specialized  circumstances,  such  as  whether  the 
enterprise  implements  a  DBMS  or  not,  and  whether  it  uses 
multiple  DPMSs  or  a  single  D3MS  for  its  databases. 
Int er prise  (s)  that  have  no  intention  to  implement  a  DEMS  tut 
DDS  will  give  a  favor  toward  a  free-standing  system.  Cn  the 
ether  hand,  the  enterprise  (s)  having  multiple  DEMS  may 
implement  a  DBMS-dependent  system  and  choose  one  cf  its 
DBMSs  to  implement  it;  or,  they  may  implement  a  free¬ 
standing  system  in  order  to  provide  more  flexible  and  fair 
control. 

Other  considerations  include  the  DDS  security 
and  a  view  that  the  scope  of  DDS  usage  is  substantially 
broader  than  the  DBMS  environment.  With  a  DBMS-dependent 
system,  personnel  familiar  with  the  use  of  the  D3MS  may  find 
it  easier  to  break  the  DDS  security  than  would  be  the  case 
with  a  free-standing  svstem. 


For  the  sake  of  co mpleteness,  there  is  ar. ether 
concept  of  DOS  implementation  method  referred  to  as  inte¬ 
grated  DCS.  This  method  offers  an  elimination  of  overlap 
between  data  dictionary  and  D3f!S-Direct or y  ty  combining 
these  features  into  ere.  The  advantage  gained  by  combining 
these  two  features  is  that  redundancy  of  storing  the 
meta-data  is  eliminated. 

c.  Active  and  Passive  EDS 

In  the  processes  that  reguire  meta-data  for  its 
execution./  there  should  be  a  command  or  series  of  commands, 
representing  some  CE5  functionality  that  produces  the 
required  meta-data.  This  functionality  is  called  dictionary 
interface,  and  there  are  two  kinds  of  such  interface:  active 
and  passive  interface. 

An  active  interface  means  that  all  processes 
that  require  meta-data  will  use  the  most  current  meta-data 
in  the  data  dictionary.  Similarly,  all  processes  which,  in 
the  course  of  their  execution  generate  meta-data  are 
required  to  store  the  generated  meta-data  in  the  data 
diet icnary . 

On  the  other  hand,  the  passive  interface  will 
have  all  that  an  active  interface  has  to  do  as  the  option. 
The  other  option  for  all  processes  that  require  meta-data  in 
its  execution  are  either  retrieve  it  from  data  dictionary  or 
some  other  locations;  or  if  the  process  already  contains  the 
meta-data,  there  exist  an  option  for  the  system  to  check 
whether  or  rot  this  meta-data  is  the  most  current  version  in 
the  data  dictionary.  in  the  case  of  generating  a  meta-data, 
the  process  also  has  an  option  to  store  or  not  to  store  the 
generated  meta-data. 

Therefore,  two  conclusions  can  be  drawn  about 
the  dictionary  interface  : 
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which  art 


(1)  A  DES  nay  have  some  interfaces 
active  and  others  which  are  passive 

(2)  The  fact  that  an  interface  is  active  is  a 
property  not  only  of  the  DOS,  but  also  of  the  overall  system 
of  which  the  DDS  is  a  part. 

2.  Data  Dictionary  and  Data  Dictionary  Schema 

Data  Dictionary  denotes  the  organized  and  structured 
collection  of  meta-data  which  comprises  the  contents  of  the 
CDS.  The  data  dictionary  schema  denotes  the  logical  struc¬ 
ture  of  the  data  dictionary,  in  a  manner  analogous  tc  the 
use  of  same  term  in  the  context  of  a  DBMS. 

The  structural  char acteristics  of  data  dictionary 
and  the  contents  of  data  dictionary  schema  will  determine 
what  kinds  of  meta-data  can  be  stored  in  the  data  dictionary 
ar.d  what  kinds  of  relationships  can  be  established  between 
them.  Some  systems  have  extensibility  facilities  whereby  ar: 
installation  can  customize  the  data  dictionary  schema. 
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A  data  dictionary  has  a  conceptual  similarity  tc  the 
Entity-Relationship- Attribute  model.  Tne  basic  unit  in  the 
data  dictionary  is  a  Dictionary  Entity  or  Entity  for  short. 
Entities  represent  real  world  objects  or  things  about  which 
certain  information  exists  in  the  data  dictionary.  And  the 
information  about  entities  themself,  exists  in  the  form  of 
Attributes  which  generally  denote  the  qualities  or  quanti¬ 
ties  of  properties  of  the  entitle  s.  Finally,  data 
dictionary  also  contain  information  about  Relationships 


between  entities,  ar.d  relationships  nay  have  attributes 
assigned  to  them. 

The  term  Entity-type  is  applied  to  some  entities 
that  have  similarities  among  them.  For  example,  if  a  set  of 
files  is  described  in  the  data  dictionary,  each  file  -will  be 
represented  by  a  distinct  entity.  It  then  becomes  useful  to 
establish  an  Entity- type  called  File  in  the  data  dictionary 
and  to  say  that  all  such  entities  representing  files  have 
the  entity-type  File.  Attributes  of  entities  of  the  same 
type  will  exhibit  a  certain  degree  of  similarity. 
Entity-type  File  will  likely  have  an  attribute  of  what  kind 
of  access  method  used,  and  maybe  another  attribute  shcvir 3 
the  blocking  factor  used.  These  both  access  method  and 
blocking  factor  are  then  called  as  an  Attribute- type  which 
is  associated  with  the  entity-type  File.  Beside  the  entity- 
type  File,  there  will  be  an  entity-type  Record.  The  informa¬ 
tion  would  exist  in  the  data  dictionary  explaining  which 
types  cf  records  are  included  in  a  given  file.  All  such 
relationships  between  these  file  entities  and  their  associ¬ 
ated  record  entities,  then  be  called  as  a  Relationship- type. 
In  conclusion,  the  data  dictionary  schema  would  be  viewed  as 
containing  all  existing  entity- types,  relationship -types, 
and  attribute- types.  Any  one  of  these  three  types  may  also 
be  referred  to  as  a  schema  descriptor. 

Every  entity  in  the  data  dictionary  has  a  primary 
name  which,  depending  on  the  particular  PDS,  will  be  unique 
either  in  the  dictionary  or  within  the  entity-type  to  which 
the  entity  belongs.  Some  systems  may  facilitate  duplicate 
user-supplied  names  by  assigning  them  distinct  sequential 
numbers.  In  this  case,  the  concatenation  of  user-supplied 
name  and  sequential  number  constitutes  the  unrgue  dictionary 
name.  The  allowable  length  of  the  primary  name  should  be 
sufficiently  large  enough  to  convey  the  meaning  of  an  entity 
in  its  primary  name.  It  is  common  that  at  least  seme 


entities  wiu  also  be  known  fcy  other  names.  Such  alternate 
names  are  called  aliases  or  synonyms,  and  the  most  important 
things  are  the  capability  or  the  DOS  for  tracking  them  and 
allowing  access  to  the  data  dictionary  via  these  alternate 
names.  Sometimes  it  is  convenient  if  non-anigue  synonyms 
are  allowed.  To  fulfill  this  requirement,  DDSs  have  facili¬ 
ties  for  tracking  synonyms  either  as  attributes  of  the 
respective  entities,  or  as  separate  entities  related  tc  the 
primary  entity.  Therefore,  it  is  important  that  the  system 
should  be  able  to  recognize  the  context  in  which  the  synonym 
is  used. 

The  attributes  can  be  differentiated  into  seme 
attribute- types,  among  which  are:  Description  =  it  consists 
of  an  English  language  statement  describing  the  meaning  of 
the  entity.  Classification  keywords  =  these  are  attached  to 
the  entities  which  then  can  be  used  for  selective  retrieval 
of  these  ertities.  Audit-attributes  =  these  are  attributes, 
generated  by  the  DDS ,  indicating  the  identification  cf  the 
person  who  created,  the  date  of  creation,  the  identification 
of  last  person  who  modified,  the  date  of  last  modification, 
and  the  total  numbers  of  modification,  all  for  each  entity. 

The  entity  itself  can  be  conveniently  separated  into 
three  entity-types:  Lata,  Process,  and  Usage  entitv-tyres. 


a.  Data  Entity-types 

The  most  common  of  this  type,  listed  with 
typical  attribute- types  and  rela tionship- types  are: 

(1 )  Item/Data  Element .  In  some  systems  the 
lowest  entity-type  is  Item,  which  is  considered  to  be  the 
atomic  unit.  In  other  systems,  the  lowest  entity  nay  be 
Data  Element,  which  in  its  turn  it  may  contain  other  Data 
Elements.  This  is  usually  specified  by  contains  clause, 
which  expresses  the  relationship. 


Commonly  provided  attribute- t ypes  relate 
to  the  physical  characteristics  of  the  Iten/llecient, 
including  distinctions  between  Source,  Target,  Internal 
representations,  and  the  validation  criteria  that  aay  Le 
required  for  the  real-world  instances  of  the  Item/21enent. 

(2)  Group/Record.  Systems  that  recognize  the 
Item  as  an  entity-type  will  contain  Group  as  a  separate 
entity-type,  whereas  systems  that  have  Data  Element  as  an 
entity-type  do  not  have  an  entity-type  for  Groups.  Record 
is  logically  the  same  as  a  Group,  therefore  separate  entity- 
types  fcr  both  of  them  may  or  may  not  exist.  A 
relationship-type  is  provided  to  express  the  structure  of 
the  Grcup/F.ecord .  Commonly  available  attribute- types  relate 
to  the  manner  in  which  the  constituent  elements  are  aligned, 
and  other  physical  characteristics  of  the  entity. 

(3)  File.  Relationship- types  are  provided  to 
express  the  structure  of  the  file.  Attribute-types  relate  to 
the  access  method  used,  blocking  and  labelling  information, 
etc. 

(4)  DBM 2-related  Entity-types.  The  entity- 
types  that  exist  are  dependent  on  the  specific  DBMS  for 
which  the  DDS  provides  support  services.  In  all  cases,  the 
entity-types  eguate  to  the  various  data  descriptions  used  by 
the  DEMS,  such  as  Schema,  Subschema,  Database  Directory, 
etc.  Relationship- types  and  attribute-types  are  provided  to 
allow  the  DDS  to  express  the  structure  of  these  entities. 

(5)  Other  Data  Entity- types.  Some  DDSs  offer 
Report,  Screen,  and  Form  entity-types.  In  each  case, 
relationship-types  are  provided  that  allow  the  contents  of 
such  entities  to  be  specified  in  terms  of  the  constituent 
element  s. 
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There  are  two  aost  common  Process  Er.t ity- 1 vpes. 
They  are  Program/Sodule  an!  System/Subsystem  as  will  be 
discussed  below. 

0)  Prooram/Mod ule.  This  entity-type  repre¬ 
sents  information  about  a  collection  of  executable  code. 
Typical  attribute-types  are  the  language  of  the  source  code, 
the  size,  and  the  characteri's tics  under  which  it  operates. 
Relationship-types  are  provided  to  other  Progr ams/Hod ules , 
as  well  as  the  data,  i.e.  databases,  files,  and  elements,  on 
which  it  operates.  Generally  speaking,  different 
relation ship- types  are  provided  for  input,  output,  and 
process irg- in- place. 

(2)  System/Subsystem.  This  entity-type 
describes  a  collection  of  programs  and/or  Modules  associated 
with  a  major  function  of  the  enterprise.  Relati onship- types 
are  provided  to  associate  a  System  with  Subsystems,  as  well 
as  the  constituent  Programs. 

c.  Usage  Entity-types 

Users  and  their  organizational  environment,  and 
the  data  communication  environment  can  be  thought  of  as 
Usage  (or  External)  Entity- types.  They  are  not  directly 
components  of  a  system,  such  as  data  and  processes,  but 
nevertheless  play  an  important  role  in  its  operation. 

The  User  and  organizational  component  entity- 
types  may  have  relationship-types  that  allow  users  tc  be 
associated  with  organizational  components.  These  components 
themselves,  and  selected  relationship- ty pes  that  associate 
users  or  components  with  data  and  process  entities  may 
describe  the  responsibilities  assigned  to  those  users. 

The  example  of  Data  Communication  environment 
entity-types  are  Terminals,  Messages,  and  other  entity-types 
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that  describe  the  communication  netwarks.  Their 
relationship-types  may  provide  the  associations  cf  such 
entities  with  other  usage  entities,  i.e.  a  given  terminal  is 
assigned  to  a  certain  set  of  people  or  a  certain  organisa¬ 
tional  unit.  Or  it  may  provide  such  associations  with  data 
and  process  entities,  i.e.  a  given  terminal  is  authorized 
to  execute  only  a  given  set  of  transaction  programs,  or  to 
access  only  certain  Files  or  Databases.  It  will  be  appro¬ 
priate  tc  note  here,  that  the  role  of  the  DDS  in  these 
matters  is  strictly  a  repository  for  documentation,  and  that 
the  CDS  by  itself  cannot  be  expected  to  enforce  such 
restrictions  and  limitations.  In  order  for  DDS  to  be  used 
for  enforcement,  appropriate  "active"  interfaces  would  have 
to  exist  to  assure  that  the  restrictions  and  controls  which 
are  documented  in  the  data  dictionary  are  always  invoked  at 
execution  time.  It  will,  on  the  other  hand,  create  more 
complexity  and  much  overhead. 

3 .  Extensibility  Facilities 

The  concept  of  extensibility  facilities  is  to  allow 
an  installation  to  modify  the  system-standard  schema  as 
delivered  by  the  DDS  vendor.  Any  new  schema  descriptor 
created  through  the  use  of  extensibility  facilities  will  be 
referred  tc  as  an  extensibility  descriptor. 

Extensibility  facilities  are  extremely  powerful,  and 
their  usage  should  be  done  with  great  care  because  exten¬ 
sions  to  the  system- standard  schema,  once  they  are  used,  can 
only  be  undone  with  some  difficulties.  Additionally, 
changes  to  the  system  may  create  confusion  among  the  users 
of  the  DCS  as  well,  and  decrease  their  confidence  in  the 
system.  Due  to  these  reasons,  it  is  recommended  that  their 
usage  should  be  restricted  to  the  Dictionary  Administrator, 
the  person  who  is  responsible  for  DDS  function,  i.e.,  the 
recording  of  all  meta-informations  and  meta-data  and  itz 


maintenance  through  the  use  of  the  DDS,  along  with  making 
its  facilities  availatle  to  the  users  of  the  system. 

There  are  three  kinds  cf  such  facilities  that  exist 
in  current  DDSs  : 

a.  Entity-type  extensibility:  the  ability  to  add  new 
entity-types  to  the  dictionary. 

b.  Attribute-type  extensibility:  new  attribute- types 
for  either  entity-types  or  relationship-types  car.  be 
declared  using  this  facility. 

c.  Relationship-types  extensibility:  this  facility 
allows  the  installation  to  declare  new  relationship-types. 

4 .  Status  Fac  il i ties 

These  facilities  allow  the  DDS  to  be  used  in  a 
System  Life  Cycle  environment  where,  for  instance,  a  certain 
entity  may  both  part  of  a  production  system  and  a  new  test 
system.  Due  to  its  intended  usage,  it  is  preferable  to 
maintain  the  same  name  for  the  entity  in  different  stages. 
Therefore,  a  facility  is  required  that  will  allow  two 
distinct  dictionary  entities  to  have  the  same  name,  yet 
different  attributes  and  relationships.  In  some  systems, 
such  entities  can  be  distinguished  by  assigning  different 
version  numbers  to  them.  In  systems  having  a  status 
facility,  it  can  be  accomplished  either  by: 

a.  Appending  the  entity-status  to  the  entity-name 
which  provides  the  uniqueness  of  the  name. 

t.  Logically  partitioning  the  dictionary  into  sepa¬ 
rate  databases  for  different  statuses,  and  requiring 
uniqueness  of  the  name  only  within  each  partition. 

5 .  Dictionary  Commands 

A  DDS  may  have  one  or  more  interfaces  that  allow  a 
user  to  interact  with  the  dictionary.  Such  an  interface  may 
in  the  fcrm  of  : 


it 


•  A  command  language. 

•  A  screen-oriented  interface. 

•  A  fixed  format  batch  data  entity  facility. 

•  A  programmatic  interface  that  allows  user  written 
application  programs  to  access  the  dictionary. 

A  screen-oriented  interface  is  more  user-friendly 
compared  to  the  others.  It  results  in  higher  utilization  of 
computer  system  resources,  but  makes  the  DDS  available  to  a 
larger  class  of  users.  Another  benefit  may  be  that  the 
error  rate  is  substantially  reduced. 

The  dictionary  commands  may  be  differentiated  into 
eight  categories  on  the  basis  of  their  functionality: 

a.  Dictionary  Maintenance  Commands:  this  enables  an 
installation  to  create  and  maintain  its  data  dictionary. 

t.  Peports  and  Queries  Commands:  an  installation 

may  generate  reports  using  meta-data  contained  within  the 
data  dictionary  using  these  commands. 

c.  Data  Structure  Interface  Commands:  enable  ether 
systems  to  use  the  descriptions  of  data  structures  contained 
within  the  dictionary. 

d.  Extensibility  Commands:  vehicle  to  exploit  the 
extensibility  facilities. 

e .  Status-related  Commands:  the  ability  to  distin¬ 
guish  entities  in  different  stages  of  the  life  cycle. 

f.  Security  Commands:  used  to  allow  security  decla¬ 
rations  to  be  assigned  to  the  dictionary. 

g.  Dictionary  Processing  Control  Commands:  used  to 
control  the  dictionary  process  such  as  lcgor.-logoff , 
processing  defaults,  etc. 

h.  Dictionary  Administrator  Commands :  exclusive 

commands  especially  designed  to  be  used  by  the  dictionary 
admi  r.istra  tor. 
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6. 


~£idoe  Facilities 

Cther  facilities  of  a  DDS  exist  in  the  form  of 
bridges  cr  interfaces  to  other  systems.  The  contents  of  the 
dictionary  may  be  made  available  as  part  of  the  processing 
functions  cf  that  system.  In  each  case,  these  systems 
provide  tools  whose  functionality  is  outside  the  DDS  hut 
which  require  data  about  entities  which  can  be  expected  tc 
exist  in  the  dictionary.  By  accessing  a  dictionary  and 
extracting  the  required  information,  the  disa dvanta ges  of 
having  tc  store  and  maintain  redundant  data  is  eliminated. 

Those  such  interfaces  are  : 

a.  Report  and  Query  System:  the  ability  to  support 
various  kinds  of  reports  and  queries. 

b.  Validation  Criteria:  support  other  systems  by 
providing  a  module  which  performs  the  specified  validation 
and  which  can  be  inserted  into  a  program. 

c.  Database  Design:  the  ability  to  provide  basic 
data  needed  by  automatic  database  designers. 

d.  Test  Data  Generation:  support  test  data  genera¬ 
tion  by  providing  descriptions  of  the  structures  and  formats 
of  the  files  and  databases. 

7 .  EDS  Security 

As  mentioned  before,  the  term  DDS  security  is 
applied  here  to  denote  the  security  of  the  DDS  itself. 
Entities  in  the  dictionary  may  have  attributes  describing 
access  characteristics  to  the  real-world  instances  cf  these 
entities,  but  this  data  is  entirely  informational  in  nature 
and  cannot  be  enforced  by  the  DDS,  since  the  system  is  not 
part  of  the  loop  in  the  execution  of  programs  against  the 
"real  data".  On  the  cther  hand,  unauthorized  access  tc  the 
DDS  can  be  eliminated  by  applying  such  security  procedures, 

e.g.,  the  assignment  of  passwords,  or  the  inclusion  of 
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security  levels  which  control  the  various  kinds  of  access  to 
dictionary  entities.  To  gain  more  integrity  and  relianiity, 
the  security  of  the  DCS  must  he  considered  to  be  related  to 
the  security  of  the  entire  computer  system.  The  ievel  of 
security  existing  in  the  computer  system  is  influenced  by 
the  security  of  the  basic  systems  software  and  the  physical 
security  of  the  installation,  as  well  as  the  procedures  used 
by  the  personnel  of  the  installation.  These  latter  are  often 
notably  lax,  and  it  is  not  at  ail  unusual  to  observe  cases 
where  passwords  are  net  kept  confidential  and  may,  indeed, 
openly  be  shared  with  unauthorized  people. 


D.  CCST/BENEFIT  ANAI5SIS  FOR  DOS 


As  applied  to  any  system  acquisition,  Cost/Benefit  anal¬ 
ysis  should  be  done  prior  to  and  in  order  to  get  a  justiii- 
cation  for  DDS  acquisition,  implementation,  and  usage.  Thu 
following  list  of  costs  and  savings  represent  tangible  items 
that  can  be  used  in  assessing  the  costs  and  oenefits  for  an 
economic  study  of  the  feasibility  of  implementing  a  DDS. 

1 .  Costs 

There  are  eight  possible  costs  which  may  be  consid¬ 
ered  : 

a.  Acquisition  cost  is  the  accumulation  of  lease  or 
purchase  cost  and  the  maintenance  cost  of  the  system. 

b.  Data  Administration  staff  cost  is  self  explana¬ 
tory. 

c.  Hardware  Cost  is  the  sum  of  storage  device  cost 
and  CPU  time  cost. 

d.  Start-up  cost  is  the  total  cost  of  training  data 
administrator  staff  and  all  activities  such  as  developing  a 
comprehensive  plan  (see  Table  I  as  an  example)  that  should 
be  done  in  any  data  dictionary  implementation. 
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TABLE  I 

Comprehensive  Plan  for  DDS's  Osage 


1.  Development  of  a  policy  for  the  use  of  the  EDS. 

2.  Development  of  standards  to  be  followed  in  the 
dictionary,  including  naming  conventions  for 
dictionary  entities. 

3.  Development  of  decisions  on  how  to  use  the  control 
facilities  of  the  DDS,  such  as  the  status  and 
security  facilities. 

4.  Delegation  of  authoritv  and  responsibiiitv  for  the 
use  of  various  DDS  facilities. 


o . 


Definition  of  procedures  for  the  use  of  the 
and  develocment  of  the  required  policies  to 
ment  these* procedures. 


DDS. 
i  mpl 


e 


i 

I 


6.  Design  and  implementation  of  customized  features  l 

for  the  DDS,  should  any  he  required.  This  may  { 

include  change  to  the  dictionary  schema  to  allcw  | 

raw  tvpes  of  information  resources  to  be  stored  j 

in  the  dictionary,  production  of  specialized  j 

reports,  or  interfaces  to  other  5/w  systems.  | 


e.  Data  collection  cost  is  a  function  of  the  number 
of  entities,  attributes,  and  relationships  which  are  to  be 
put  in  the  dictionary. 

f.  Haintenace  cost  will  depend  on  the  degree  or 
changes  to  the  application  system  or  systems  contrclieu  by 
the  DDS. 

g.  Application  system  change  cost  is  a  cost  perti¬ 
nent  to  any  change  to  the  application  systems  due  tc  the 
implementation  of  the  dictionary  for  reasons  of  efficiency, 
integrity,  and  maintainability. 

h.  User  education  cost  is  the  cost  for  training 
people  involved  in  data  dictionary  usage  in  addition  tc  the 
data  administrator  staff. 
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2.  I§£tor_s  for  Estimating  Savings  and  Benef  its 


There  are  four  factors  which  can  be  used  as  ar.  aid 
in  estimating  the  quantification  of  savings.  The  greater  the 
degree  to  which  these  four  factors  are  held  to  apply  tc  the 
enterprise  and  its  operations,  the  more  the  high  end  cf 
estimated  saving  and  benefits  can  be  expected.  Those  four 
factors  are: 

a.  The  Maturity  of  an  Information  Processing 
Environment 

This  is  a  major  factor  in  the  benefits  that  can 


le  attained  with  a  DDS. 


Increased  maturity  will  help 


substantially  in  the  integration  of  dictionary  facilities 
into  the  operations  cf  the  enterprise. 

h.  The  Complexity  of  the  Environment 

The  number  of  data  elements,  files,  databases, 
and  programs  can  be  used  to  measure  how  complex  the  informa¬ 
tion  processing  environment  is.  Problems  caused  bp 
complexity  tend  to  worsen  geometrically  with  the  number  of 
such  elements,  files,  databases,  and  programs.  The  value  of 
a  DDS  will  be  greater  as  complexity  increases. 

c.  The  Degree  of  Data  Sharing 

It  should  be  common  practice  that  data  elements 
are  shared  by  different  programs,  where  some  of  th^se  are 
from  di  f  ferent  s  yste  ms.  An  import  an  t  issue  is  that  char,  ges 
in  one  part  of  the  system  tend  to  have  effects  in  many  ether 
parts  of  the  system.  Failure  to  compensate  for  such  changes 
can  cause  production  failures  and  unanticipated  costs. 
Tracking  the  effect  of  these  changes  is  a  valuable  feature 
of  a  EDS.  It  is  made  possible  by  evaluating  the  attributes 
and  relationships  of  changed  entities  as  the  basis  for 
f art  her  tracking. 


d.  Personnel  Turnovers 

The  personnel  associated  with  information 
processing  systems  is  either  lata  processing  organization 
personnel  or  user's  organization  personnel  used  to  deal  wit!, 
the  information  processing  system.  In  this  regard,  a  DDS 
offers  two  advantages:  First,  information  which  otherwise 

mignt  be  stored  in  the  minds  of  individuals  and  which  may  he 
lost  to  the  organization  with  the  loss  of  the  individual  is 
now  placed  in  the  DDS .  Secondly,  the  learning  curve  for  new 
personnel  is  steeper  than  it  would  be  without  the  use  of  a 
DPS. 

3 .  Savings  and  Benefits 

There  are  five  areas  in  which  savings  and  ter.  ei  its 
may  be  expected.  The  four  factors  mentioned  in  the  previous 
section  can  be  used  as  aids  in  predicting  specific  monetary 
savings  in  each  of  these  following  areas: 

a.  System  design  and  development:  the  prime  advan¬ 

tage  of  the  DDS  is  in  its  use  for  better  comnur.icat ion 
between  users  and  implementors.  This  results  in  fewer 
changes  or  iterations  and  consequently  faster  progra mmir 
because  the  s peci f ication  is  better  documented  an 

understood  by  all  parties. 

b.  System  maintenance:  better  and  more  complete 
documentation  in  the  dictionary,  the  ability  to  analyze  the 
effect  of  proposed  changes,  and  the  improved  communication 
between  users  and  maintenance  programmers  on  proposed 
changes  or  corrections. 

c.  Data  redundancy:  reduction  of  unplanned  data 
redundancy  will  result  in  an  improved  system  which  has 
greater  integrity  and  better  operability  as  well  as 
potentially  decreased  requirements  for  random  storage 
devices . 
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IV.  INITIAL  DATA  DICTION AE I  DESIGN  FOE  DISPU LL AHTAD 


A.  GENEFAL 

As  mentioned  before,  the  initial  design  of  Data 
Dictionary  will  be  limited  to  four  application  areas: 
personnel,  payroll,  intelligence  personnel,  and  territorial 
personnel.  This  design  is  a  first  step  in  the  Stage 
Development  approach  used,  therefore  it  may  be  expanded  m 
the  future. 

E.  DATA  DICTIONARY  SCHEKA/S OBSCHEM A 

In  a  manner  analogous  to  the  context  of  a  DB.MS,  Data 
Dictionary  Schema  denotes  the  logical  structure  of  the  data 
dictionary  (DD) .  In  this  regard,  then,  the  term  Subschema 
will  denote  a  subset  of  the  schema  to  be  seen  tv  a  giver, 
application  (process)  or  user  [Ref.  6],  and  it  is  compatible 
with  application  views  of  a  database  [Ref.  7]. 

1 •  Eata  Dictionary  Schema 

The  structural  characteristics  of  the  LD  and  the 
contents  of  the  DD  schema  are  important  aspects  of  the  usage 
of  a  DDS ,  since  by  evaluating  these,  users  may  know  what 
kinds  of  me-ta-data  and  relationships  between  them  exist 
within  the  DD.  As  suggested  by  Lefkovits  et  al  [Ref.  8], 
the  structural  characteristics  of  a  DD  may  be  described  ir. 
logical  terms  in  order  to  gain  a  clearer  insight  of  what 
kinds  of  meta-data  are  supported  by  the  DDS.  The  logical 
structure  of  a  typical  DD  drawn  by  Allen  et  ai  [Ref.  5]  seem 
to  be  appropriate  for  DISPULLAHTA D’ s  DD  with  only  minor 
changes,  and  it  is  presented  as  figure  4.1. 
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DATA  DICTIONARY  SCHEMA 


Figure  4.1  Logical  Structure  of  DISP0L1 AHT AD ' s  DD 
(adapted  from  Allen  et  al  [Ref.  5]). 


There  are  three  kinds  cf  entities  (from  the  left  to 


he  right)  within  the  logical  CD  in  figure  4.1  : 


39 


a. 


Data  Entities 


These  consist  of  database,  subschema,  relation¬ 
ship,  file,  group  of  elements,  and  data  element  entities. 
Wherein  each  record  may  consists  of  some  elements,  tut  may 
or  may  not  have  group  cf  elements  in  it.  File  and/or  record 
entity  is  the  subject  of  a  process  entity,  while  data 
element  entity  is  the  subject  of  a  transaction  or  a  report 
entity. 

t.  Process  Entities 

A  process  entity  may  be  an  application  program, 
a  program  module,  or  a  system/subsystem  that  typically 
generates  a  report  or  does  a  transaction  involving  either 
data  element  or  a  grcup  of  data  elements. 

c.  Usage  Entities 

Included  in  this  category  are  user  and  their 
organizational  environment  (such  as  processors  and  termi¬ 
nals)  ,  and  data  communication  environment  (such  as 
communication  network,  communication  nodes,  messages,  etc.). 

2 •  lata  Dictionary  Subschemas 

Since  there  are  four  applications  included  in  this 
initial  design,  there  might  be  four  subschemas  accordingly. 
Due  to  the  relatively  small  amount  of  meta-data  that  will  be 
stored  in  the  initial  dictionary,  however,  and  in  order  to 
reduce  complexity,  it  is  better  not  to  apply  subschemas  at 
this  point.  Later,  if  the  dictionary  has  grown  substan¬ 
tially,  and  many  applications  have  been  added,  the 
subschemas  may  be  applied  in  order  to  improve  efficiency  and 
security. 


c. 


D ISPULIAHT A  D * S  DATA  DICTIONARY 


The  design  of  this  DC  is  based  on  current  applications 
for  which  the  DBMS  has  not  been  implemented  yet.  The 
following  tables  present  the  lists  of  Data,  Process,  and 
Usage  entities  and  one  or  two  of  the  actual  instances  of 
DISPUIL AHTAD’s  data  dictionary. 

1 .  Entities 

a.  Data  Entities 

Table  II  summarizes  data  entities  abstracted 
from  the  personnel  application  [Ref.  9  and  Ref.  10].  Table 
III  presents  data  entities  for  the  payroll  application  [?ef. 
11  and  Fef.  12].  Data  entities  used  in  the  intelligence 
personnel  application  is  presented  as  Table  IV  [Ref.  13  and 
Ref.  141.  Finally,  data  entities  from  the  territorial 
personnel  application  are  shown  in  Table  V  [Ref.  15]. 

b.  Process  Entities 

Process  entities  belonging  to  the  four  applica¬ 
tions  are  presented  as  Table  VI  .  This  table  lists  only 


routine  processes,  not  irregular  processes,  since  the  latter 
ones  are  not  yet  standardized  (they  are  done  as  requested). 


c.  Usage  Entities 


Table  VII  presents  user  entities  for  the  four 
appl  ications. 


TABLE  II 

Personnel  Application  Data  Entities 


V 

o 

•H 

G 

... 

> 

O 

o 

(-1 

O' 

■H 

1 

OJ 

P  a> 

C 

U3P 

1 

-c 

•a  >~i  in 

•H 

3  3 

1 

u 

a  p 

4- 

P  O 

1 

T3 

4-1 

S3  33  0  O.0) 

G  P 

0>  '4) 

3  3 

1 

Ui 

033  3  CnP,C 

P3J 

1 

a; 

O 

oi  a; 

43 g  aj+J v  cu  op 

to  a  gp 

COWP  C/i 

M 

1 

H 

U 

c/3  m 

.-1  3  >*H  3P  O'  _c 

3  0  0  a 

P  3 

"J 

1 

•H 

<v 

3 

<t3  3— t  GP  3  <K  C7P 

P.--IH  3 

-H— 1  4+J 

M  ^ 

G— .  | 

in 

Pa 

C/3  S3 

■a  bpgcocjp  o  u 

3  cups 

3  3  *r4  3 

o>  a  ^ 

O  1 

3 

O  O  its  -1-A-A 

4J  B  U 

aacjaopjjous  r^p 

•H-G  ( 

1 — 1 

cd 

.HP 

01 CJ  P  (/Jr-C-tU  gcq 

ton  GLi  O  O  O  41  co'cibo/s  so 

P  C/3  1 

0) 

+J 

cua> 

CU  p  U  0)  0)  -A 

D  Q)  U-H'H  CUJ 

0  3ja  — vh 

O.H  1 

g 

03 

e 

cop  m-A  a  CH4  04-1 

— 14-|  CJ3)&upP  D  O  OOS  BP  0 

•HrH  | 

a 

Q 

.  G 

VO-H+J  SCOOO 

AOU4 

3  3CJ3  G 

3M4JVU 

U  CT1| 

o 

3  0 

C/3  G  C/3  0  O 

■A  OOMGG  S3-H  0  MIS-HG  3 

o  a  i 

i/i 

G 

.  cn.x  cup-h-h  tn  c/3  a>  <u  a> 

•H  O 

03334-.CJC/3U3O  ,G  CT'CT'C 

c/3  to  i 

u 

•H 

CO  p  a  P-H  12  DM-l  PPPP  X  Pp  1>  (0*0  U03K330  COO-H-H— 1 

01  1 

U> 

03 

.  <1)  (8  0  C35  OOMUitdiflaim  3-CP  P  U  U  3r-4  O  O-G  3  1  a;  0>A! 

g — ) 

a. 

3: 

waKUC«:i-ini04ueQina«^HOOO^C4  3. 

M 

•-H 

X3-G 

1 

1 

332 

1 

n  3 

rH  p  | 

1 

•H 

C 

M 

3  3  IH  1 

C/3 

3 

e  0 

cn  p  -a  I 

1 

u 

4J 

3H 

m  hjhg  l 

1 

a  G 

0) 

•H  3 

i—4 

C  C/3  cucuo  _  I 

1 

3  3 

3  Hi 

3  B 

■A  3  0)  O  1  CT< 

— *  1 

. — t. — 1  CCHG. 

4J-H 

»  3 

3)  3 

HGCOE-CH  s 

1 

•H-H  G  1  C/3 

3  t-i  G  GJ 

-X 

B 

'  H 

G  1 

•H 

G  G  O  C33 

^  O 

■3  O^H*H  G 

3'H 

CCS  UGQ  C  G  G  G  ! 

G  3  1 

C 

O  OH  G  CU 

U  DIG  B-H.X 

rH  44 

fd 

A  3  3  3  3  3  GP  | 

O-H  1 

O 

-X 

03  CO  O  3.X 

G  M 

O  a.  3  3  2  3 

G  3 

4-*  G. 

S  B-G  P  U  U3)  3-H  1 

•a  cn  1 

U) 

0 

H  uoaa  3-H  3  O 

'T.'s.J'H  3^4 

•HQ 

fda3dd'0  33d(0»OH  | 

4->  Q3  1 

u 

-X 

3lll\3Hfl3aiHOH  OSC 

C4 

(d  g  1 

cue  1 

<u 

0 

cua4  4JM-w 

l_|4->Q5r-IS  -H 

CUSX  a  CT>H 

3333  COS 

A  O  | 

cu 

Ol, 

3\3  G  3  C/3  O  3CQ  3  C/3  U 

3  3  03)44  0302  -A 

UT3  1 

Ul  X(naHS3  7G«!3Cll3ft 

UH+»*Hit)a  MMUTOH)  1 

U  G  | 

<v 

3 

O  3  D1CU3  B  34J  O 

C7>H4J 

O  G  3  CTO  CUP  Q)  O  O  O  CT>3  U  1 

C/3  W  1 

•■H 

4-> 

BBcU+/4J*'3Vf-lHCfliflH 

B-H-Q 

G  G  B  33)  B  a  B  G  U  U  1 

01  1 

•H 

3 

03  3  00304J3S:2S304JSS 

O  M  ^  0>4->  O  O  O  O-rH  V  \ 

O'—  1 

Q 

ZZGJXinKCOXt<t4B3COH 

I  W 
I 

>1  I  M  to  <  IOHMH  ca  Z  W  03  OSSCS-GS  HH 

p  i  ix.  y.  k  s  to  sc  pi  os  ui  ca  « us  cc  s  z  *e  HKHfflsiHDHt' 

•H  0)  i  to  O  w«coft«WUOomxHH'o 

-4-1  a  i  os  ca  OiEzusHsxruOU'G.-iacc’ai  atejasrjsxj^ascucusnasasras 

C  it)  I  W  «*  QoaCOOSSOt-'E-iCJCJJtSQHO  0«<Hhh1H0W0MHW« 

OJS  I  Uj  a  ZZU)U«U*l'IMtHtrit^SCWe'  2U.'-3(OCJt-,IOJ£.yit-'t-|tHUU3 


I 

>1  I 
4-  I 
•A  0/  I 
p  ap 
C  >.  I 
uie  i 


a; 

H 

in 


ppppp-l_>ppppppppp 

35  g  c  c  g  c  c  a  e  c  c  c  c  c  a  c 
U  0)<VO)0)0/0><UO)0)0/0)0)0)0)d> 
O  a  a  a  a  a  a  e  a  a  a  a  s  a  a  a 

U  O/O/QJO/O/OiOlOJC/cVO/iUO/OJCD 
«  WWfeWKWWWWKWKWWU 


GGCGCGCGCCGGCC 

aaaaBaeeeaaasa 

a)a>o>a>Q/o>a)0><Da)a>c/<D:i) 


KWHWKWWWWUIWWWW 


I  . .  ......  . . .  • 

i  ^-rjoo-s’LT^r'CCCT'Or-tNfo^'io  ^or-ccc'O.-r.iro^nrKor^cocs 

|  •  t-»-t-.-(N  <N<N<NCNCNCNCNCN<N 

•  |  < 

O  I 

S  I  M 


Table  II 

Application  Data  Entities  (cont*d 


l 


— 

p 

o 

OH  31 

•H 

O  .3 

P 

G 

c 

e 

CH 

O 

O  CS 

T>  3 

u  ic 

•P 

•H  O 

G  3 

■H  35 

01  3  P 

4->H 

•P 

05 

PUCIO 

d4-> 

G  3  P  G 

G  C/1  p 

•H 

0*  O  O  P 

o  <d 

r3 

O  G  31  0)  O 

oi 

O  t-(  ra 

05  00 

pup-pp 

S  O  >i 

U 

■P  P  P  3p 

O' 

■h  cm 

05  01 

J3-HP  0) 

•  c-TO-H-P  >i 

O 

P  31  T<  U  Pp 

V  fO 

P  (D  fH 

fl  31  P  05  ’J'fl  3-H 

3 rH  P-4 ‘H-H.p 

o 

3P  31  CO  3  3 

cna 

O  U» 

H  S  11  C  3  =  K  a 

•  O  *r|r—H-H 

0) 

pOcr.  o  PP 

3  o> 

a  0)  o 

CJPP0  3  -rH  XI  IH  .  05-HP 

OS 

3  U  HH  OOC 

s  a 

oouu 

•h  aeon  entrap 

•  >P  01H3-P 

O  3>P  3  31 

ht>3 

p*h  >-h 

Pa  3 

3  GP  3  C/5  3-P  3«£-Q 

c 

o  ghs  p  a  s  g  n 

Shi 

Oi'MOl  > 

0)  3 

ChS-H  < 

o 

OOP  MO  O  O  31 

<0  a 

Ml  u 

01M 

oo>> 

G 

•H 

•P  3-P  PPP 

a  a  owo  g  m 

P  iSH 

«d4-|-P  rc < — < 

-P 

ppp  opmp  a. 

05  who  ocn 

o  uh  p  oou  o  o-h  o  oi -c  o  n 

fd 

03303  33 

•ppp-H  Ci  G-H 

U1.G-H  O  0)  3) 

rH  SD  -P*H*H 

a, 

M  c  Mo  mm 

UOlflld  CJ-ri  01  3Pp 

XL  O  'H  HflSHH+JPHP-ilH  05  O 

rd 

31  3U  a  31  3 

•HOtJUmn  uo 

•H  SP-H-H-H  >-P  3V3  M01 

O 

P  O  G  O  03)  OP 

3P  ai  o  o  a)  3  c/}*h  ro 

P  0)-C  C  3  3  S  S  C  G-H  GP  3>X  O, 

o 

3  O  3P  O  p  O  O 

JCaSPaOSQ  3aa 

H  a  a  a 

ZZ^DDDUD^EOjCO 

Q 

QOKPOQCO 

i 


i 


C 

M 

• 

3 

GP 

-B 

p 

3 

3  05 

«  G 

•P  G 

3 

•P 

P  G  01 

3  O 

P  P3 

n 

P 

•P  3  P 

3  05 

3  3\p 

c 

3) 

P  p  O 

a.  pc 

P  G  3 

a 

a  3 

05 

M  3  J5 

a  Eli  0)  3 

3HS 

•d 

3  P 

31 

■P  P  O 

•P  3)  3  O.  05 

•Mip  a;  05 

3 

4J 

ip  3 

u; 

(O  O)  3)  >i 

hc  pm  e  3 

P  p  05UJ  3 

P 

•d 

3P 

p 

G  CUC  03 

3  31  P  3  P  O' 

3  01  Ql-P  05 

M 

“1 

Q  3 

3  3-G 

3  3  3  CH 

pa  o  05-p  3 

oipu;  05  3 

d 

*3 

PCCD 

&.G  05  G 

•HP 

GP 

G 

3  3  P  a  3 

cn  -PC 

H<j»;  33  o  3) ps  p  p.pau; 

G  3  3 

3  G 

QOQl-H  P 

05  3  e  3  0)  3 

cc5<«aa3cii(/i 

3  P  3 

Oh 

P  3 

G  3  01  -P 

fl-n3  0133ACfl  T3X3-U  (T3  a  a 

P  301 

01-P  3  3 

C®Q<  * 

G  PP  DVnl+JftCrtl  C  G  G  G  G  3-Pol  3  3 

3-3 

p;  pp;  g 

3H  P 

•H  G*H  Ofi 

3  3  3  3HHH-H  3  3 

G 

XI  3P 

C/5  3  tr>3 

PW  3  3  01 

fdU-4  (d 

P-G  3  3  3  3 

•pp  aa 

<d 

3W5  3 

Q  G  05 

3  G  3  05  05  3  CO 

pp.p  a  p  3  3pppp  p  osi-p  a  a 

4J 

r>  Pi 

u 

rdj< 

-Q  O  >i3  3  a 

3  3  0)  =»  05  3  <UMP  3  <3  3  3  O  G  05  3  3 

01  C7) 

oa  o>3 

SMPPP  3H 

05  05  ou  30)aeao3  03  03  03a  coi  3  a  a 

-Q 

H-O  G 

aw 

G'H 

3  0  3  3  3  USE 

3  3-p  3H  3)0330)0013)  O-P-P-H  3)  01 

<d 

S  O  3 

O^: 

«O03CQcnrt;ei 

EEQ1 liCKZniiC^SC^KEOSGCc; 

Muia 

ZCOOjCm 

a; 

a 

a 

o 

05 

p 

u 1 
CL, 


E>ra<«s:c/i*c«;  mu; 
(Q«s;MCi«:a:cn  aw 
SKEHXSKIti  aa 
ou: 

uBPOOUHPoSM  SEE 


<  mmin 

cmeh  as»  in  JUHB 
pa«coca«:aOMu:ac/5 
DtfKaic^wcnuoazrinaa 

*e  < 

Di-lwaaz<«;<«<oHHi-iDa 

u5u:</iz;«j:«:i/iciii/ic/5^Ewu;aa 


P  P  p  p  p  p  P  PP 

G  C  G  G  G  G  C  G  G 

0)  0)  0)  0)  01  01  01  01  01 

e  a  a  a  s  a  e  aa 

(D  (D  5)  5)  0)  J)  0)  5)0) 

hhhhhhh  hh 
WWWWWWW  ww 


CCCCGCCCCCCGCGCC 

oioioioioioioioioioioioioioioioi 

aaeaasBaaaaaaaaa 

oioiaioiojoiojaioiaiaioioioioioi 

a  w  a  a  a  wwwwwwwt;t5ww 


0»~<Nm  G-lXvC  MOO 

mmmmmmm  mm 


o>o<— (NroG’UH'xMcocwo*— csima- 
m  g-  3-  c  3  a-  a-  a-  a-  -a-  3  inmin  mu  i 


25 

CQ 

cqcq  cn 

CQQrcr; 

■<<c,nL*z 

< 

BD«n 

m 

*< 

OO^ 

ODHrt 

axnp 

H3 

ppp 
c  c  c 

pppp 
G  G  G  G 

p 

01  01  01 

0)  01  01  0) 

0 

a  a  a 

a  a  a  a 

o 

01  01  01 

01  01  01  01 

01 

pi — ip 

Hi  Ii  ip 

a 

WWti 

WKWP 

*  •  » 

r-c-jn 

t  •  •  « 

G-iX-vOM 

i 

I 


43 


Table  II 

Application  Data  Entities  (cont’d 


• 

x 

ra 

o 

<4-4 

V 

u 

01 

O 

x 

X 

T9 

o 

-X 

CL 

0) 

0 

X 

o 

o 

X) 

•X 

a 

o 

X 

e 

>1 

G 

E 

44 

T) 

s 

cc 

o 

•o 

X 

•H 

D 

a, 

u 

X  o 

X-X  X 

•X 

O 

c  c 

S 

•r4 

o 

O  X 

a 

cox  a> 

dd 

—  X 

X 

o  c 

X  (/) 

o 

■he,  x 

o 

X  o-x  3)40 

X 

X  X  X  O  -X 

V) 

X 

O  3 

0) 

4C44  0 

•H 

o  -h+j  use 

o 

o  o  o-x  r*4 

■X 

4-i 

V 

t/)4-» 

a; 

X  O  X-X 

4-» 

•H  44(933)3 

o 

•x-x-xx  w 

33 

<T3 

03 

oi  id  c 

3  H  0)44 

(T3 

4  9(9  U332 

a ) 

444444  3  i)  x 

cun 

X 

ox  o 

>4 

05  033  O 

u 

333  U3B3 

K 

3  3  3  (433  O 

, — 1 

o 

to-x 

14 

x  x  a 

a 

U  0  33  X  <0 

(-4  X  X  0)  X  -X 

3 

O  Q> 

*  44 

o 

<ua,o  o 

SUdWUl/l  3 

X 

0)  <B  0)  OO  X 

G 

O-c  e  a  a  c  e  3 

T> 

-X  X 

cn 

3*  W  © 

3 

0,0,00  a  3 

O 

ox  o  o  o  o  o  V 

<11 

0)4444  ecu 

WC  X  0433 

X 

000  xox 

-X 

X-X  X'X-X  C 

44 

33  O  O 

004  oxoo 

o 

X  O  U  0) 

•  44 

'44  X  44  44  W  X  44  1) 

3 

004  -H'H 

u 

04. H  o  0  X 

-X 

0,14-4144  0-3(4  CU 

33  (0 

O  <d  «  id 

id  -o  3 

u 

U  O  XX  O 

<T5 

OX  X-X  X  3 

X 

000  44  0 

*  o. 

0,0,14  0,0, y 

\ 

0)  o 

4-> 

3  9HX O  3 

3 

•1)  3  X 

x  n 

0).V  3393  3X 

-* 

4*  4)43  S  33 

•H 

9UU3  3X133 

X 

0)33  0)  0  14  o)  a; 

a  o 

4->  G  0  Otj  O  y  Q- 

X 

X44  SOX 

rH 

X  X  3  l/l  X  e  3 

0) 

X  C  a  3  0)33  X 

O  O 

idfdCOUOUB 

3 

3  3  3  X-X 

•H 

3 33 r-4  933X 

cu 

3-X  3-H  O.X  3 

<J  o 

H«OOOOOH 

W 

05  055  £X|W 

n 

«WC,s;OKO 

o 

O^ZCjQOQ 

W) 

• 

X 

3 

•H 

0) 

X) 

4 4 

4^ 

00 

•X 

a) 

3 

•X 

•X  3 

X 

n 

4^ 

33 

to  X 

•X 

c 

cr> 

-X 

3  41 

X 

G 

a  id 

44  X 

u 

33 

X  0, 

a 

3-0 

3-H  3 

0) 

x  a 

0)  c 

w 

3 

4J 

r-4  3 

X 

Xi  XCu 

4J 

<D  3  X  W 

0, 

44 

340 

3 

CD3 

•H 

O,  X4 <.  3  X  3 

O  -X 

3  c 

r-i 

-Q 

CO  3 

Cn 

XQ  X 

fH 

3-X43  3  X 

■X  3 

X  (0 

n 

X 

3  3 

■H 

-HX3-HJ<  3 

-X-X  C/l-X  to 

3  4-> 

44 

a 

o 

44  0,4444 

r: 

3-X-H33-X  3W 

3  (O-x  3  (0  0) 

Q  3 

a  <d 

3  G 

rH 

344  3-x 

W3333-X33  3 

X  3  (0  X  3  X 

XI 

(d  3 

a. 

44  3 

o 

44  3  a**  3 

*-* 

3-X  X33-X  U)X 

3X30)  X-X  0) 

C  3 

xtn 

3  3 

o 

trxs  0)  C*C 

«T3 

3333  0)  X3J-X  X 

sc  0)  x  o,u)  on 

o  n 

<d 

M 

■x  G 

N 

X  CO*  X  4) 

CO,  0)  319  X 

0,0)0  0,3 

•X 

-OX 

to  <d  a  tr>3 

44 

3  GCO  3S0 

•H 

x  a)  a,  d  e> 

•X 

XO  0,  OQiX 

X  C 

3  id 

3 

O,  3  Q, 

T3 

3Q,x  ru-x 

3  O-X  3 

3  3 

r)j^ 

x 

44  3-y; 

j* 

a,  x  g 

l«H  3  X 

3 

0>B  (0  C  C  0> 

O  -X 

cr> 

0  0,11)  ovd 

<r> 

<V  0  0,3 

'O 

erXD  a,H  AH  O 

X 

03>3  3  3-X-X  03 

>H  3 

f4  C 

aw-O  qr-4 

c 

3)t4  a  0)  u 

G 

X3J  awaas 

O' 

G  U  S4i  X  X  C 

iH  X 

w  3 

o^idani) 

3 

OSC  044  3 

>D 

(90919330 

Q, 

3  3  3  0  0,0,3 

Q,  a 

Ehcu 

SStOOCUCU 

a, 

WfcirBtOS: 

a 

E-iCHSMOZ 

o 

HStCSWtrnoe  1 

X 

0) 

33 

033  3 

Eh 

M 

a 

CQ 

=3Cq 

CQC005DCQ 

f-(  Eh 

M 

WU9WC0 

to 

WWtOtHC^ 

G 

*J9«l£HCq«) 

xC 

h«;«:eh 

r 

W WHMMWH 

c 

to  to  0, 0. 0,  «s  a. 

O 

n 

«933 

O'O-alrtE-i 

a 

HHOCQtlD 

« 

a,  0,00010c 

tn 

33^ 

tc&imow 

<«o,kk; 

M 

DQiCrlRrlK 

w 

ooze  wa,cuw 

X 

CG 

09<! 

O  Q«:f4«s! 

!Kt3WQ!C 

Q 

uooto<s:o 

04 

0«Crt)0  W  Wfr! 

4) 

cu 

n> 

txuu 

iftscooix 

U4 

UtlXKS 

CG 

XKHKXOZ 

u 

X  *3  rK  X  W  !*-.  t/i 

"■J 

V-P 
G  G 

X  X44  X  X 
C  G  C  G  e 

T3 

xxx  xx 

G  e  G  C  G 

'O 

xxxxxxx 

G  G  G  C  G  G  G 

T? 

+J  4^  4->  4->  -fJ 

C  G  G  G  ^  G  C 

X 

4)  4) 

0)  Q) 

u 

'U  <D  <i>  <D 

X 

Q)  <D  0)  (1)  Q)  Q)  0) 

X 

0)  4)  0)  4)  4)  4)  X 

0 

s  a 

a  a  a  a  a 

0 

a  a  a  a  a 

0 

a  a  a  a  a  a  e 

0 

a  a  a  a  g  a  e 

0 

4)  0) 

QQQ)0)0) 

0 

J)  O  9)  O  1' 

u 

0)  Q)  0)  0)  Q)  0)  0) 

u 

'D  0)  0)  1)  Q)  Q 

X 

fHrH*HrHi— f 

0) 

4) 

HHHHHf  H1  < 

4) 

t  4r-4r 

cc 

ww 

wwwww 

CG 

WWWWW 

UG 

WWWW WWW 

a. 

w  W  P 1  w  w  w  w 

t 

u 

•  • 

rn^t  ir.\or^ 

* 

Q 

#  •  •  •  # 

r-rjrn  rrm 

• 

w 

r-rNm 

• 

Mx 

•—  rvjroa’  Lr,vnf^ 

Table  II 

Application  Data  Entities  (cont’d 


o 


w 

■  d 

T-J 

o 

>4 

u 

4- 

U 

o 

u 

o 

d 

o 

a» 

u 

u 

u 

a. 

G 

a, 

a) 

3 

G 

3 

•H 

a 

or. 

"d 

{X 

Q 

O 

o 

c«j  a; 

cn 

U 

04  U 

cn 

cn 

•H 

>. 

04  cn 

w 

(/) 

G 

d 

•H 

4-> 

i-4 

vi  end 

0) 

a 

0/ 

3  G 

-G 

d 

o 

-g 

d 

•H 

•  d-d 

cp 

u 

■iH 

4J  cn 

-G-G  O 

>o 

4-J 

Cm 

c 

d-d  U 

u 

U 

l_,-d  4-1  U 

T3 

o  o  d 

04 

u 

-W  4-> 

u 

d 

a)  d  u 

T3 

a> 

d  04  >43  3 

d 

d  d  a- 

E 

o 

<3  U 

•H 

o 

ja  u  d 

"3 

*c 

zx 

C-d  44  Gi  4-> 

O 

04  0)  d 

u 

— *-H 

CQ 

C  O 

s  «r 

«< 

X 

0)  O  3  1) 

u 

d  dra 

a 

U4 

<PCQ 

oo 

'+4 

WE 

U-J 

QWQ44G 

01 

QCQ 

o 

03 

05 

4-J 

•H  G 

0 

C  '4-1 

U 

1> 

o 

03 

'4-4 

W-d 

0 

■U'-W  O 

'M  0  1/1 

o 

G 

c 

U-IU-IU-I  '4-1 

*4-4*44  O 

S+J 

>1 

bxO 

nj  O-H 

rH 

>40  t/I 

G 

Cn 

O  O  Or4  O 

_G 

o  o 

•d  (0 

rd 

'H 

'P 

G-i  *m 

•H 

H  04  04 

0/ 

O 

•H 

•d 

o 

0) 

+->  d 

■H 

•h  a)  a>  o 

g  p-u 

d 

■d  0)  O  d 

-G 

C/3 

0) 

a  oir-i  d  0) 

d 

14T3  O 

O  0) 

a 

e-*j  s  rd  x 

4-> 

S+4  fld 

o 

U 

U 

4-440  G  4-4  4-> 

04 

44  G  d 

•H  CM 

d 

d  (0  fOH  4HJfl0 

CP 

d  d-Ht: 

d 

V 

o 

d  O-d  01  d 

d 

d-d-d 

>•0 

o-/ 

U-,Q53CV4lnOa« 

Q 

id  w  Oi «: 

U-J 

IX 

Ud 

QU1C.QP 

id 

tdlG  U4 

vj 

G 

G 

4-> 

d 

td 

d 

o> 

cn 

G 

-u 

G 

•H 

M 

cn 

CTi 

d 

U 

to 

u 

d 

d 

•r-> 

(P 

G 

<d 

C/2  d 

cn 

cn 

G 

cn 

G 

(d»H 

G 

G  b^ 

u 

1-4 

CQ 

CP 

44  ,a 

G 

m<P 

<-)  d-d  G 

d 

d 

G 

d  3 

d 

use 

oi  mu)  a 

G 

3 

U 

4^-m  d 

d  G 

G 

ss  ij+jx  d 

fH 

rd  Q-/ 

•H 

d 

Cn3  /-4 

G  d  d 

3  u  <TJ 

dt/1  d  O' 

<P 

<U  G  d: 

C3 

G 

CM  d 

d  i"d 

H-d  end  =) 

H  U 

be 

U3-d  G  d 

O 

*G 

d  urn 

■d  U ' i_J 

04-G  d-dr-lN*  l-t  d 

»-d  O' 

y) 

uiuwifla 

G 

-O  G  d 

(0 

ig  <d  d-c  <y 

0!  3 

*-« 

c  d  *  G 

u 

G 

q;  t-i  d  enw 

d 

d  d"-n 

10 

►d  3  ds*s  ucc-d 

d 

d*i  d  04 

0) 

d 

cq  d  en3t<; 

d 

'PH  C4 

d 

d 

a  — « »-4 

d  0> 

cn 

-*  i£rJ 

a. 

a 

end  Eh 

d 

04  04  SC 

n 

cn 

d-H  0) 

G  drdtG 

G 

•dr-1 

d 

.H  01  Fh  rd 

m 

hGCU 

c 

d 

cndtG44.d-dd 

d 

rrj  d  4-4  4-4 

-p 

4-> 

r-f 

dZ  G  d 

O' 

4-4 

d  d 

d 

a  m 

dBM  end 

u 

■d  end  d 

d 

d 

d 

O'  a  d  o' 

G 

3  3d 

-0-3 

3 

3  end  a,d  <d  cne 

<P 

dj  cno,s 

e 

s 

cn 

CnO)  d-d  O' 

d 

44  d  04 

d  g 

rd 

■a  a  a  e-n^;  c  d 

4-> 

g  G  a  d 

d 

d 

G 

G-O  O  d  G 

■ — i 

j<;  O  a 

O  d 

04 

3dd04  04  04  d’-1' 

0) 

w  d  0>  >h 

-H 

r-H 

V 

d  o  d  i-4  d 

04 

d  d  04 

IGH 

>3 

KE-'Ze^GiE-’C 

be 

CUE-lE-i«< 

< 

cu 

EHStJaceoE-i 

04 

an  H 

04 

«c 

rd 

G 

Z 

0 

►JS«3  lGU:«a:  bC 

w 

rd  a  Eh 

E-* 

memo  co 

z 

C 

«e>G 

cc 

CO  C3  SG  CG  a  CH  <  «c 

be 

Ka«s< 

in 

rd 

< 

03C3  03 

o 

coco 

-G 

tGCCrsISCMOIEHSC 

Qd 

IGrebSn 

PI 

e> 

CK-dOrdOi 

05 

«3«c«e 

t/J 

w< 

JO 

cQHdSi-dsctGvi-a: 

H 

IGbGbdrS 

05 

<c 

25 

CQClDniG 

UUO 

d 

O»o 

►d 

G>  0*8  04  4-1010  0 

kM 

MOOrd 

Q 

rd 

w 

UWtdHO) 

0 

octx 

04 

CM 

be 

X 

ut^ne 

•G 

C 

fSS^CDt-' 

Ui 

4J4J 

4J  44  4-4  44  44  44  4J  4-4 

4J 

4-> 

44  44  44  4J  44 

4J4J4J 

G  G 

gggggggg 

G  G  G  C 

T3 

G 

G  C  C  C  G 

»r 

G  C  G 

04  01 

d 

cpcpcp^'pcpcpcp 

d 

Q)  04  0)  0) 

d 

0» 

d 

0)  (p  CP  <p  CP 

d 

04  O  O 

a  a 

0 

asaaeasa 

0 

a  a  a  s 

O 

a 

O 

a  a  a  a  a 

O 

a  a  a 

0)  04 

O 

DU(DQ)(P(D(U^ 

0 

04  0)  04  0) 

U 

04 

O 

0)  <P  cp  (P  0) 

O 

04  04  0) 

HrH 

04 

r  ^>Hr  <»  IrHrHrU 

04 

rHrHrHrH 

0) 

rH 

O 

0 

rHiHfM 

WW 

U-i 

WWKWWHUd 

tx 

PdfdW  W 

Pd 

w 

or. 

wwwww 

C3 

UJWO-i 

•  • 

•  •  t  • 

• 

•  •  « 

cccn 

r”CNrr)3-ir'jDr-cr! 

-(Ndd 

nm  cj-un 

• 

C 

• 

3C 

9 

H 

• 

n 

• 

be 

45 


Table  II 

Personnel  Application  Data  Entities  (cont'd 
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TABLE  IV 

Intelligence  Personnel  Application  Data  Entities 
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2. 


Relation  shi ps 


The  relationships  can  be  represented  Ly  the 
following  relations: 

RELATIONSHIPS (relation  name , key , composite  key, secondar y_key , 

otner_entity  names) 

- key - 

a.  Relation  Fame 

In  pre-DFHS  situation,  this  may  be  filled  with 
the  record  entity  name  to  represent  the  relationships 
between  element  data  entities.  In  the  case  where  a  D3HS  has 
been  implemented,  it  should  be  filled  with  the  relation  name 
since  a  logical  record  may  consist  of  some  relations  in 
order  to  reduce  complexity  and/or  fulfill  the  five  normal 
forms . 


b.  Key 

This  is  ar  entity  name  that  is  used  as  the  key 
for  both  storing  and  accessing  the  relation.  If  the  relation 
uses  composite  keys,  this  will  be  the  first  part  of  the 
composite  keys  where  the  second  part  will  be  stored  as 
composite-key  attribute. 

c.  Composite  Key 

This  is  the  entity  name  used  as  the  second  part 
of  a  composite  key.  If  primary  key  is  not  composite,  this 
attribute  will  be  filled  with  "NONE". 

d.  Secondary  Key 

This  is  the  entity  name  used  as  the  secondary 
key.  If  the  relation  has  no  secondary  key, 
will  he  filled  with  "NONE". 


this  attribute 


e.  Other  Entity  Names 


The  names  of  other  entities  (except  keys)  in  the 
relation  will  be  filled  by  this  attribute.  An  ampersand  sign 
(  S  )  will  be  used  between  two  entity  names  and  a  sentence 
of  "REPETITIONS  OF"  will  be  written  in  front  of  a  repeating 
group  of  elements. 

3 .  Attrib  utes 

Every  entity  has  information  attached  to  it  called 
attributes.  In  the  following/  all  information  that  may  be 
included  as  attributes  will  be  discussed. 

a.  Data  Entities 

There  will  be  several  pieces  of  information 
included  for  each  data  entity  (either  file,  record,  or 
element  entity)  .  Since  the  DC  typically  be  implemented  as 
one  of  the  DBMS  application,  these  can  be  represented  as  a 
relation  of: 

FILE  ENTITY  (entity  name, block  size, access  method, logical 

record  size, physical_storage  device) 

- key - 

RECORD_ENTITY (entity  name, length, fixed  variable  code, key, 

composite  key, secondary_key. 
updating  ^ime, updating  mode) 

- key - 

ELEN ENT_ ENTITY (entity  name,lengthfcode, source, user, 

definition) 

- key - 

( 1 )  Entity  Name . 

The  name  of  the  entity  is  limited  to  a 
maximum  cf  eight  characters,  this  may  be  a  combination  of 
alphabetic  and  numeric,  but  should  contain  an  alphabetic  as 
its  first  character. 

(2)  3 lock  Size. 

Self  explanatory. 


a 


I 

I 


A 

I 


(3)  Access  Method  . 

This  may  be  encoded  as: 

£  -  Sequential 

I  -  Indexed  Sequential 
E  -  Direct  Access 
V  -  Virtual  Storage  (VS AM) 

(4 )  Logical  Record  Size. 

This  is  equal  to  record  length. 

(5)  Physical  Storage  Device. 

This  may  be  coded  as  the  following: 

TAPE  -  Magnetic  Tape 

DISK  -  Magnetic  Disk 

DROM  -  Magnetic  Drum 

FICP  -  Floppy  Disk  /  Diskette 

CARD  -  Punched  Card 

PAPE  -  Paper  Tape 

(6)  Fixed  /_  Variable  Code. 

The  codes  used  are: 

FIX  -  Fixed  Length  Record 
VAP  -  Variable  Length  Record 

(7)  Key. 

This  is  an  entity  name  used  as  the  key  for 
both  storing  and  accessing  the  relation.  If  the  relation 
uses  composite  keys,  this  will  be  the  first  part  of  the 
composite  keys  where  the  second  part  will  be  stored  as 
composite-key  attribute. 

(8)  Composite  Key. 

This  is  the  entity  name  used  as  the  second 
part  of  composite  key.  If  a  primary  key  is  not  composite, 
this  attribute  will  be  filled  with  "wONE". 


(9)  Secondary  Key. 

This  is  the  entity  name  used  as  the  secon¬ 
dary  key.  If  the  record  has  no  secondary  key,  this 
attribute  will  be  filled  with  "NONE". 


(10)  Tiae. 


This 

attribute  cental 

ns  a 

descr 

irt 

i  or. 

about  how 

often 

this  rec 

ord  will  be  updat 

ed.  T 

his  wil 

1  l 

d 

number  of 

days . 

(11) 

Updating  Mode. 

This  a 

ttribute  contains 

desc 

rip*  tion 

ab 

out 

how  the 

updating 

is  dene 

.  It  is  encoded  as  foil 

ows: 

BATCH 

Batch  Processi 

ng 

ONUS 

Online  Process 

ing 

BOTH 

3oth  of  Batch 

and  On 

lin  e 

(12) 

Entity 

Length. 

This  denotes  the  length 

of  a 

n  entit 

y  ( 

u.dV 

be  record. 

or 

element) 

.  In  the  case 

of  var 

iable  r 

eco 

rd. 

this  info 

rmatio 

n  will 

be  filled  with 

zeroes 

,  sine 

a 

the 

length  of 

each 

record 

will  be  attached 

wit  hi 

n  the 

rec 

ord 

itself. 

(13) 

En  tity 

Code. 

This  is  a  code  for  the 

charac 

ter  type: 

A 

Alphabetic 

N 

Numeric 

AN 

Aphanumeric. 

(14) 

Source 

• 

This 

denotes  the  org 

anizat 

ional 

e  r.t 

ity 

responsible  for 

pr evid 

ing,  updating. 

ana 

deletin 

3 

the 

entity. 

(15) 

Osers. 

This 

denotes  the  org 

anizat 

ional 

ent 

ity 

(entities) 

allowed  to 

retrieve  and  us 

e  t  he 

entity 

• 

If 

subschemas 

are  a 

pplied  t 

o  the  DD,  this  at 

tribute  will 

net 

b  e 

necessary. 

(16)  De  f inition . 

This  provides  a  detailed  narrative 
describing  the  entity.  This  may  include  the  information 
about  f reguency_of_update,  range_of_acceptable  values,  and 


so  on 


At  the  present  time,  every  entity  r.aie 
(especially  element  entity)  has  no  alias  attribute  attache! 
to.  In  the  future,  this  attribute  sure  will  be  needed.  One 
way  to  accomodate  this  need  is  by  add  another  relation 
called  alias  relation  in  which  has  at  least  two  attributes: 
entity-name  and  its  alias-name. 

h.  Process  Entities 

The  information  included  as  attributes  in  the 
process  entity  are:  name,  input_entity ,  output_entity ,  and 

the  description  of  the  process.  These  can  be  represented  as 
the  relation  of: 

PROCESS  (process_name, input  entity, output_entityf description^ 

of  process, aescr  of_input, input  media, 
descr  of  output, output  media)  ~ 

- key - 

But,  since  a  process  may  have  more  than  one  of  either  input 
or  output  entity,  this  relation  should  have  a  composite  keys 
rather  than  having  only  the  name  as  its  single  key.  Eecause 
only  one  input/output  entity  is  allowed  in  every  instance  of 
PROCESS  [Ref.  6],  processes  having  more  than  one  input/ 
output  entity  may  waste  storage  since  all  attributes  will 
appear  unnecessarily  more  than  once.  In  the  case  that  this 
relation  has  a  composite  key,  a  query  asking  which  data 
entities  are  input  (or  output)  to  a  given  process  also  poss¬ 
esses  a  difficulty  since  this  relation  can  not  be  retrieved 
using  only  the  process_name  (it  must  be  retrieved  using  its 
composite  key,  instead).  In  order  to  make  a  better  payoff, 
those  attributes  may  be  arranged  using  the  following  three 
relations: 

PROCESS  (process  name, description) 

- key - 

PR OCESS_ INPOT (process_name ,input_entity, description. 


(1 )  Process  Name. 

Process  name  will  be  limited  10  maximum  of 
eight  characters  as  applied  to  data  entities. 

(2)  Description  of  Process. 

Self  explanatory. 

(3)  Input  Data. 

The  input  data  may  be  either  a  file, 
record,  group  of  elements,  data  element,  or  data  entered 
from  console. 

(4)  Description  oj  Input. 

This  attribute  may  be  filled  with  a 
description  such  as  the  input  data  is  "sorted  tv  BANK"  for 
instance. 

(5)  Input;  Media. 

As  to  physical  storage  device  attribute, 
this  attribute  may  be  filled  with  input  storage  device,  or 
data  entered  via  console. 

TAPE  -  Magnetic  Tape 

DISK  -  Magnetic  Disk 

DEEM  -  Magnetic  Drum 

FLCP  -  Floppy  Disk  /  Diskette 

CAED  -  Punched  Card 

PAPE  -  Paper  Tape 

CCNS  -  Data  Entered  via  Console 

(6  )  Output  Data . 

The  output  data  may  be  ma gnet ic- data , 
displayed  data,  or  printed  material. 

(7)  Descript  ion  of  Output. 

This  attribute  may  be  filled  with  a 
description  such  as  the  output  data  is  "sorted  by  KOTAMA" 
for  instance. 


(3)  Input  Media. 

This  attribute  may  be  filled  with  output 
storage  device  or  printed  output  material  and  these  are 
encoded  as  follows: 


TAPE  -  Magnetic  Tape 

DISK  -  Magnetic  Disk 

DEUM  -  Magnetic  Drum 

F1CP  -  Floppy  Disk  /  Diskette 

C ABB  -  Punched  Card 

PAEE  -  Paper  Tape 

PEIN  -  Printed  Material 

c.  Osage  Entities 

The  information  pertinent  to  these  entities  have 
teen  included  and  can  be  derived  from  data  entities  in  terms 
of  who  is  responsible  for  update  operations  and  who  is 
allowed  to  retrieve  a  data  entity.  Here,  this  information 
will  be  stated  again  from  the  reverse  point  of  view,  that 
is,  which  data  are  the  responsibility  of  this  entity,  and 
which  cata  are  allowed  to  be  retrieved  by  this  entity. 
Information  that  will  be  included  as  the  usage  entity's 
attributes  are:  user_name,  description  of  the  user, 

ent ity_name,  and  type_of_access.  For  reasons  similar  to 
those  discussed  concerning  the  process  entities,  since  a 
given  user  may  have  more  than  one  data-entity  as  its  respo- 
sibility  and/or  to  be  retrieved  to,  these  attributes  may  be 
represented  by  the  following  three  relations: 

USER  (user  name, description) 

—  key  — 

DSER_ACCESS (user  name, entity  name, type  of_access) 

- - - key - - -  ~  “ 

OSER_RESPONSI3ILITI (user  name, entity_name) 

e 


( 1 )  user  Naae. 

As  with  the  data  entity  and  process  entity 
names,  the  usage  entity  name  will  be  limited  to  a  maximum  of 
eight  characters,  toe. 

(2)  Entity  Name. 

Entity_name  here  means  a  data_entity_ name. 

(3)  User  Responsibility. 

This  may  be  a  list  of  file,  record,  group 
of  element,  or  data  element  entities  that  are  this  user's 
responsibility.  This  can  be  viewed  as  subschema. 

(4)  User  Access. 

This  may  be  a  list  of  file,  record,  group 
of  element,  or  data  element  entities  that  may  be  retrieve  by 
this  user.  This  can  be  viewed  as  subschema,  and  may  cr  may 
not  be  same  as  the  list  of  user  responsibility  items. 

(5)  Type  of  Access. 

The  type_of_access  is  either: 

B  -  Read  only 
0  -  CJpdate  only 

E  -  Both  P.  and  U 
N  -  No  access 

(6)  Description  of  Entity. 

Self  explanatory. 

c.  Summary  cf  Relations 


:able  VIII 


s  ummariz  es 


the  relations 


relationships,  data  entities,  process  entities,  and  usage 


entities. 


Example  cf  Relations 


Table  IX  presents  examples  of  file,  record,  and 
element  entity  relations.  Examples  of  process  entity  rela¬ 
tions  and  user  entity  relations  are  presented  as  Table  X. 
Finally,  an  instance  of  the  relationship  relation  is  shewn 
in  Table  XI. 


TABLE  VIII 

Relational  Model  for  DISPOLLAHTAD  Dictionary  System 
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4 .  Example  of  Data  Diet  ionary * s  Queries 

beta-data  contained  within  data  dictionary  may  be 
used  to  answer  questions  asked  by  top  managers,  users,  or 
technical  staff  (such  as  system  analysts,  programmers,  and 
operators).  In  the  following,  several  queries  and  the 
corres jonding  responses  will  be  presented. 

a.  Top  Management  Queries 

Top  management  may  ask  a  question  like:  "How 
often  is  the  personnel  masterfile  updated  ?" 

Possible  answer  is: 

PEBSFILE  is  updated  once  every  30  days. 

t.  User  Queries 

The  user  responsible  for  personnel  management 
may  ask  the  following  question:  "I  need  a  list  of  personnel 
having  rank  of  captain  who  speak  French  fluently  and  are 
experienced  in  the  intelligence  field.  Is  DI5PULL AHTAD  able 
to  provide  these  data  ?". 

Possible  answer  is: 

See  entities: 

1.  PAHGKAT  (rank)  in  relation  PERSINTL 

2.  ASING  (foreign  language)  in  relation  PEBSINT1 

3.  AKPASIHG  (active/passive  code)  in  relation  PEESINTL 

c.  Technical  Staff  Queries 

"What  are  the  inputs  and/or  outputs  of  process 
DP? S 1 7  ?",  is  one  possible  question  asked  by  an  operator  for 
instance. 

Possible  answer  is: 

Process  DPPS17 

Input  is/are: 

1.  DAPOKDPP  (sorted  by  DPP  ),  media:  TAPE 


Output  is/are: 


1.  D APOKDPP  (sorted  by  KOTAMA  )#  media:  TAPE 

2.  KEKPANGN  (sorted  by  KOTAMA  ),  media:  DISK 

d.  Unanticipated  Queries 

One  of  the  advantages  of  designing  th 
dictionary  using  the  relational  model  is  it  can  accommcdat 
unanticipated  queries.  As  one  of  the  DBMS's  application 
this  dictionary  supports  any  query  expressible  in 
relational  query  language  (e.g.  SEQUEL). 


V.  THE  IMPLEMENTATION  OF  DATABASE  AT  D IS PULI A HT AD 


A.  DATA  DICTIONARY  DESIGN  AS  A  STEPPING-STONE 

The  implementation  of  a  database  at  DISPULLAHTAD  may 
take  advantage  of  the  design  of  Data  Dictionary  in  the 
preceding  chapter  in  many  ways: 

•  The  designed  DC  may  be  used  as  the  first  D3HS  applica¬ 
tion.  Then,  the  experience  gained  here  can  be  used  in  the 
future  application  of  the  real  database  implementation. 

•  DD  as  a  repository  of  all  meta-data  will  provide  a 
full  specification  and  description  of  all  entities  and  rela¬ 
tionships  between  them.  Given  this,  the  implementation  of 
DBMS  will  be  faster  due  to  fewer  changes  and  iterations  in 
database  development. 

•  In  the  case  where  an  automatic  database  design  tool  is 
used,  such  as  DATA  DESIGNER,  it  may  be  interfaced  vith  the 
CDS  in  order  to  take  an  advantage  of  the  CDS  contents.  The 
database  designer  may  benefit  from  the  full  descriptions  of 
each  entity  contained  within  the  data  dictionary. 

In  this  regard,  the  design  of  DD  for  current 
DISPULLAHTAD  applications  can  be  considered  as  a  stepping- 
stone  to  the  DBMS  i nplementation. 

B.  DISPULLAHTAD*  S  DATABASE  DESIGN 

From  Tables  II  through  V,  it  can  be  seen  that  there  are 
many  data  element  redundancies  within  the  four  applications. 
These  redundancies  Eeed  to  be  eliminated  by  designing  a 
database  fulfilling  the  five  normal  forms.  One  possible 
method  is  by  gathering  the  common  data  elements  in  one  rela¬ 
tion,  and  other  specific  data  pertinent  to  each  of  the  four 
applications  in  separate  relations.  The  specific  data  model 
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for  each  application  must  likely  consist  of  more  thar.  cne 
relation. 

In  designing  the  database,  availability  of  data  descrip¬ 
tion  may  be  exploited  to  make  this  work  easier.  For  example, 
suppose  the  following  relationship  is  designed  in  the 
personnel  database: 


MAIN PEES  {nopers,nana, pang kat, corps, jafcatan, satminkl) 
— key- 


Here,  the  possible  descriptions  contained  within  the 
dictionary  that  may  be  extracted  are: 

•  Which  files  and  records  would  have  to  be  accessed  in 
order  to  establish  an  instance  of  this  relation  ? 

•  What  is/are  tte  key/composite  keys  of  each  records 
derived  from  the  preceding  query  ? 

•  What  is  the  length  of  each  of  those  entities  ? 

Furthermore,  in  order  to  satisfy  the  five  normal  feres  a 

full  and  clear  description  of  each  entity  is  needed.  These 
descriptions  are  contained  within  the  data  dictionary.  The 
issue  of  actual  database  design  is  beyond  the  scope  of  this 
thesis  and  is  left  for  possible  follow-on  thesis  work. 


C.  CHOOSING  THE  DATA  DICTIONABY  SYSTEM  (DDS) 
1 .  Features 


The  available  commercial  DDS  have  most  of  the 
following  features  (see  Figure  5.2  for  more  detail). 

a.  Dictionary  Schema 


This  is  a  feature  used  to  generate  a 
er’s  standard  schema,  such  as  entities, 
attributes. 


manufactur- 
an  d 
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relationships. 


b.  Schema  Extensibility 


This  is  a  feature  whereby  an  installation  is 
able  to  customize  the  manufacturer’s  standard  schema  by 
adding  to  it  new  entity-t yp es ,  relationship-types,  an  3 
attribute- types. 


c.  Dictionary  Maintenance 


to  create 


This  is  a 
and  maintain 


feature  that  enables 
its  data  dictionary. 


an  installation 


d.  Heports  and  Queries 


An  installation  may  generate  reports  using  meta¬ 
data  contained  within  the  data  dictionary.  A  DDS  provides 
these  abilities  via  these  features. 


e.  Bridge/Interface  Facility 


This  feature  generates  descriptions  from  the 
data  dictionary  needed  by  other  systems,  typically  an 
application  development  tool  such  as  DATA  DESIGNER. 


f.  Program  Access  Facility 


This  enables  an  installation  to  extend  the  func¬ 
tionality  of  a  DDS,  i.e.,  the  preparation  of  programs  able 
to  access  data  dictionary  contents. 

g.  Status  Facility 

This  feature  provides  the  status  of  each  entity, 
especially  when  the  data  dictionary  is  used  in  the  system 
life  cycle,  by  providing  aids  in  application  development. 

h.  Security  Facility 


An  installation  may  restrict  access  to  the  data 
dictionary  to  authorized  personnel  only.  This  requirement  is 
fulfilled  by  this  feature. 
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2. 


PIS PULL AHTAD  ♦  £  DD 5  Eec  uiremen  ts 

a.  Data  Dictionary  Features 

There  are  five  data  dictionary  features  required 
in  order  tc  implement  DISPULLAHTAD ' s  data  dictionary.  These 
are : 

•  Dictionary  Schema  which  is  needed  ir.  order  to 
generate  the  entities,  relationships,  and  attributes. 

•  Schema  Extensibility  which  will  be  needed  to 
accommodate  specific  needs  that  cannot  be  fulfilled  by  the 
dictionary  schema. 

•  Dictionary  Maintenance  which  is  used  tc  create 
and  maintain  the  data  dictionary. 

•  Beports  and  Queries  which  are  required  in 
order  to  generate  reports  from  and  to  extract  data  contained 
within  data  dictionary. 

•  Security  Facility,  even  though  the  data 
dictionary  has  no  actual  instances  of  "secured”  data,  it 
will  contain  information  about  such  data  that  can  be  used  to 
access  it,  i.e.,  entity_name,  access_method,  where  such  data 
are  stored,  etc.  Therefore,  a  security  facility  is  reguxred 
to  add  and  strengten  the  level  of  security. 

Other  features  are  optional.  These  can  be 
considered  as  "nice  tc  have". 

b.  Active  Versus  Passive  Data  Dictionary 

A  full-active  or  partially-acti ve  system  is  very 
desirable  because  it  provides  features  such  as  enforcing 
standards,  range_of_ value  auditing,  transaction  monitoring, 
etc.  On  the  other  hand,  an  active  system  possesses  much 
overhead,  suffers  in  terms  of  longer  turn-around  time,  and 
requires  more  complex  processing  algorithms.  Therefore,  at 
this  time,  a  passive  data  dictionary  system  is  more  appro¬ 
priate  for  DISPOLL AHTAD.  In  the  future,  after  enough 


experience  has  been  gained,  a  partial-active  system  aav  be 
applied  in  order  to  take  fuller  advantage  of  the  data 
diet  ionary . 

c.  Free-standing  Versus  D3MS-dependent  System 

A  free-standing  system  is  very  appropriate  for 
an  installation  having  different  D3MSs.  This  may  happen  in 
an  installation  with  databases  using  network  or  hierarchical 
structures  in  conjuction  with  newer  technology  such  as  the 
relational  system.  In  this  case,  all  systems  may  access  the 
data  dictionary  independently,  since  the  usage  of  the  lata 
dictionary  is  not  limited  to  any  one  system. 

On  the  ether  hand,  a  data  dictionary  may  be 
implemented  as  one  of  the  DBMS  applications  {some  DDSs  are 
implemented  in  this  manner)  [Ref.  8].  This  approach  is 
appropriate  for  an  installation  implementing  databases  using 
a  single  DBMS,  and  DISPOLLAHTAD  falls  into  this  category. 
Therefore,  a  DBHS-dependent  system  will  provide  more  advan¬ 
tages  for  DISPOLLAHTAD,  e.g.,  it  can  be  used  as  a  training 
tool  in  implementing  the  database.  Another  possible  advan¬ 
tage  is  that  if  DISPOLLAHTAD  should  change  from  a  passive  to 
an  active  system,  there  will  not  be  too  many  modifications 
required  because  the  data  dictionary  and  the  databases  are 
already  compatible. 

d.  Make  or  Euy 

Buying  an  available  commercial  system  may 
provide  a  high  quality  and  ready-to-use  system.  Figure  5.2 
summarizes  features  of  current  commercial  systems  that  may 
be  used  in  choosing  the  best  system  fulfilling  the  required 
features.  A  commercial  system  used  by  many  installations 
without  much  trouble  may  be  an  indication  that  it  has  a 
certain  quality.  Criteria  listed  in  Figure  5.1  may  be  used 
in  selecting  the  best  system  for  DISPOLLAHTAD. 


1.  It  should  have  at  least  the  following  five  features 
that  may  be  considered  as  the  primary  criteria: 

a.  Dictionary  Schema. 

b.  Schema  Extensibility. 

c.  Dictionary  Maintenance. 

d.  Reports  and  Queries. 

e.  Security  facility. 

2.  The  following  fcur  criteria  may  be  considered  as 
the  secondary  criteria: 

a.  Compatible  with  current  hardware. 

b.  Compatible  with  applied  DBMS. 

c.  Hign  quality  assurance. 

d.  low  Acquisition  and  Set-up  Costs. 


Figure  5.1  Criteria  for  choosing 
Commercial  Data  Dictionary  System. 

On  the  other  hand,  one  significant  advantage  of 
designing  a  DDS  in-house  is  that  the  system  can  be  fitted  to 
specific  requirement  of  the  installation.  Furthermore, 
given  that  DISl’iJLLAHTAD  will  implement  a  DBMS,  the  rela¬ 
tional  dictionary  presented  in  the  previous  chapter  may  be  a 
good  candidate  for  the  first  application.  By  implementing 
the  dictionary  as  a  relational  database,  only  one  system 
needs  to  be  acquired  (DBMS)  instead  of  two  (DBMS  and  DDS)  . 

3 .  Recommendation 

For  the  reasons  discussed  in  the  previous  section, 
it  is  tetter  for  DISPULLAHTAD  to  implement  the  data 
dictionary  model  described  in  the  previous  chapter  as  the 
first  application  of  the  DBMS.  The  dictionary  will  be  a 
DBMS-dependent  system.  Initially,  the  system  should  be 
passive,  and  later,  if  appropriate,  it  may  be  changed  to  an 
active  system. 
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gure  5.2  Features  of  Current  Commercial  DDS 
(summarized  from  Lefkovits  et  al  [Ref.  1]). 


VI.  CONCLUSION 


I  mplementing  a  DBMS  is  a  ’’mast"  for  DISPULL  AHI A  E  i  r. 
order  to  control  the  proliferation  of  its  applications  that 
in  turn  raises  problems  of  data  redundancy  and  data  incon¬ 
sistency.  Primarily,  a  D3MS  provides  data  manipulation 
capabilities  whereas  a  data  dictionary  provides  management 
and  control.  Applying  management  and  control  (by  imple¬ 
menting  EDS)  first  will  make  the  job  of  database  design  and 
implementation  easier  in  term  of  lessening  the  difficulties 
and  the  time  and  effort  required  to  develop  databases.  In 
this  regard,  designing  a  data  dictionary  may  be  considers  1 
as  a  stepping-stone  to  the  implementation  of  a  DBMS. 

This  thesis  has  presented  a  relational  model  of  a 
dictionary  which  can  satisfy  the  needs  of  DISPUL LAHTAE.  The 
advantages  of  this  model  are: 

1)  it  is  compatible  with  any  relational  DBMS  that 
DISPOIL AHTAD  may  procure. 

2)  it  obviates  the  need  to  buy  a  DDS  in  addition  tc  the 
DBMS. 

3)  it  can  be  tailored  specifically  to  DISPULLAHTAE* s 
needs  because  of  the  flexibility  of  the  relational  model. 

4)  it  satisfies  the  criteria  for  a  DDS  (see  figure  5.1) 

In  order  to  attain  the  objective  of  managing  and 

controlling  data  resources,  the  following  implementation 
policy  is  recommended: 

•  All  personnel  involved  in  software  development  (such 
as  system  analysts,  programmers,  etc.)  should  use  the  data 
dictionary  extensively  in  doing  their  jobs. 

•  Only  the  data  administrator  staff  may  update  the  data 
dictionary,  others  may  access  it  in  read-only  mode. 


•  All  suggestions  concerning  the  lata  dictio 
addressed  to  the  data  administrator  staff. 

This  thesis  has  stopped  short  of  suggesting 
design  fcr  DIS PU LL AHTAD’ s  personnel  application, 
vork  could  be  done  using  the  dictionary  model 
herein  as  a  foundation. 
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